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Abstract 

Let h > w > be two fixed integers. Let if be a random hypergraph whose 
hyperedges are all of cardinality h. To w-orient a hyperedge, we assign exactly w 
of its vertices positive signs with respect to the hyperedge, and the rest negative. A 
(w, fc)-orientation of H consists of a w-orientation of all hyperedges of H, such that 
each vertex receives at most k positive signs from its incident hyperedges. When k 
is large enough, we determine the threshold of the existence of a (w, A;)-orientation of 
a random hypergraph. The (w, fe)-orientation of hypergraphs is strongly related to a 
general version of the off-line load balancing problem. The graph case, when h = 2 
and w = 1, was solved recently by Cain, Sanders and Wormald and independently by 
Fernholz and Ramachandran, which settled a conjecture of Karp and Saks. 



1 Introduction 



In this paper we consider a generalisation to random hypergraphs of a commonly studied 
orientation problem on graphs. An h-hypergraph is a hypergraph whose hyperedges are all of 
size h. Let h > w be two given positive integers. We consider Q n , m ,h, the probability space of 
the set of all /i-hypergraphs on n vertices and m hyperedges with the uniform distribution. 
A hyperedge is said to be w-oriented if exactly w distinct vertices in it are marked with 
positive signs with respect to the hyperedge. The indegree of a vertex is the number of 
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positive signs it receives. Let k be a positive integer. A (w, fc)-orientation of an /i-hypergraph 
is a ^-orientation all hyperedges such that each vertex has indegree at most k. If such a 
(u>, /c)-orientation exists, we say the hypergraph is (w, k)-orientable; for w = lwe simply say 
k-orientable. Of course, being able to determine the (w, fc)-orientability of an /i-hypergraph 
H for all k solves the optimisation problem of minimising the maximum indegree of a w- 
orientation of H . If a graph (i.e. the case h = 2) is (1, A;)-oriented, we may orient each edge 
of the graph in the normal fashion towards its vertex of positive sign, and we say the graph 
is k-oriented. 

Note that a sufficiently sparse hypergraph is easily (w, fc)-orientable. On the other hand, 
a trivial requirement for (w, fc)-orientability is m < kn/w, since any u>-oriented /i-hypergraph 
with m edges has average indegree mw/n. In this paper, we show the existence and deter- 
mine the value of the sharp threshold (defined more precisely later) at which the random 
/i-hypergraph Q ntm ^ fails to be (w, fc)-orientable, provided k is a sufficiently large constant. 
We show that the threshold is the same as the threshold at which a certain type of subhyper- 
graph achieves a critical density. In the above, as elsewhere in this paper, the phrase "for k 
sufficiently large" means for k larger than some constant depending only on w and h. 

The hypergraph orientation problem is motivated by classical load balancing problems 
which have appeared in various guises in computer networking. A seminal result of Azar, 
Broder, Karlin and Upfal [2] is as follows. Throw n balls sequentially into n bins, with each 
ball put into the least-full of h > 2 randomly chosen boxes. Then, with high probability, 
by the time all balls are allocated, no bin contains many more than (In In n) / In h balls. If, 
instead, each ball is placed in a random bin, a much larger maximum value is likely to occur, 
approximately Inn/ (In Inn). This surprisingly simple method of reducing the maximum is 
widely used for load balancing. It has become known as the multiple-choice paradigm, the 
most common version being two-choice, when h = 2. 

One application of load balancing occurs when work is spread among a group of computers, 
hard drives, CPUs, or other resources. In the on-line version, the jobs arrive sequentially and 
are assigned to separate machines. To save time, the load balancer decides which machine 
a job goes to after checking the current load of only a few (say h) machines. The goal is to 
minimise the maximum load of a machine. Mitzenmacher, Richa and Sitaraman [22] survey 
the history, applications and techniques relating to this. In particular, Berenbrink, Czumaj, 
Steger, and Vocking [2j [3] show an achieveable maximum load is m/n + O (log log n) for m 
jobs and n machines when h > 2. 

Another application of load balancing, more relevant to the topic of this paper, is men- 
tioned by Cain, Sanders and the second author [5]. This is the disk scheduling problem, in 
the context where any w out of h pieces of data are needed to reconstruct a logical data 
block. Individual pieces can be initially stored on different disks. Such an arrangement has 
advantageous fault tolerance features to guard against disk failures. It is also good for load 
balancing: when a request for a data block arrives, the scheduler can choose any w disks 
among the h relevant ones. See Sanders, Egner and Korst [25] for further references. 

These load balancing problems correspond to the (w, /c)-orientation problem for h- uniform 
hypergraphs, with w — 1 in the case of the job scheduling problem. The machines (bins) are 
vertices and a job (ball) is an edge consisting of precisely the set of machines to which it 
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can be allocated. A job is allocated to a machine by assigning a positive sign to that vertex. 
The maximum load is then equal to the maximum indegree of a vertex in the (w, fc)-oriented 
hypergraph. 

The work in this paper is motivated by the off-line version of this problem, in which the 
edges are all exposed at the start. This has obvious applications, for instance, in the disk 
scheduling problem, the scheduler may be able to quickly process a large number of requests 
together off-line, to balance the load better. This can be useful if there is a backlog of requests; 
of course, if backlogs do not occur, the online problem is more relevant, but this would indicate 
ample processing capacity, in which case there may be less need for load balancing in the first 
place. Trivially, the on-line and off-line versions are the same if h — 1, i.e. there is no choice. 
For h = 2, the off-line version experiences an even better improvement than the on-line one. If 
m < cn items are allocated to n bins, for c constant, the expected maximum load is bounded 
above by some constant d depending on c. 

To our knowledge, previous theoretical results concern only the case w = 1 (this also 
applies to on-line). For w = 1 it is well known that an optimal off-line solution, i.e. achieving 
minimum possible maximum load, can be found in polynomial time (0(m 2 )) by solving a 
maximum flow problem. As explained in [5], it is desirable to achieve fast algorithms that are 
close to optimal with respect to the maximum load. There are linear time algorithms that 
achieve maximum load 0(m/n) [TT | [1U | |2"T]. 

A central role in solutions of the off-line orientation problem with (w, h) = (1, 2) is played 
by the k-core of a graph, being the largest subgraph with minimum degree at least k. The 
sharp threshold for the /c-orientability of the random graph Q(n, m) = Q n ^ m ^, was found in [5], 
and simultaneously by Fernholz and Ramachandran |l4j . These were proofs of a conjecture 
of Karp and Saks that this threshold coincides with the threshold at which the ik + l)-core 
has average degree at most 2k. (It is obvious that a graph cannot be fc-oriented if it has a 
subgraph of average degree greater than 2k.) In each case, the proof analysed a linear time 
algorithm that finds a fc-orientation a.a.s. when the mean degree of the (k + l)-core is slightly 
less than 2k. In this sense, the algorithms are asymptotically optimal since the threshold for 
the algorithms succeeding coincides with the threshold for existence of a A;-orientation. The 
proof in [14J was significantly simpler than the other, which was made possible because a 
different algorithm was employed. It used a trick of "splitting vertices" to postpone decisions 
and thereby reduced the number of variables to be considered. 

During the preparation of this paper, three preprints appeared by Frieze and Melsted [12] , 
Fountoulakis and Panagiotou [13], and by Dietzfelbinger, Goerdt, Mitzenmacher, Montanari, 
Pagh and Rink [10] which independently study the threshold of (1, l)-orientability of Q n ,m,h, 
i.e. the case w = k = 1. This has applications to cuckoo hashing. However, there seems to be 
no easy way to extend the proofs in [101 H2J [13] to solve for the case k > 1, even when w = 1. 

In this paper, we solve the generalisation of the conjecture of Karp and Saks mentioned 
above, for fixed h > w > 0, provided k is sufficiently large. That is, we find the threshold 
of (w, /c)-orientation of random /i-hypergraphs in Q n , m ,h- The determination of this threshold 
helps to predict loads in the off-line w-out-of-/z disk scheduling problem, where the randomness 
of the hypergraph is justified by the random intial allocation of file segments to disks. We 
believe furthermore that the characterisation of the threshold in terms of density of a type 
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of core, and possibly our method of proof, will potentially help lead to fast algorithms for 
finding asymptotically optimal orientations. 

Our approach has a significant difference from that used in the graph case when (w, h) = 
(1,2). The algorithm used in [H] does not seem to apply in the hypergraph case, at least, 
splitting vertices cannot be done without creating hyperedges of larger and larger size. The 
algorithm used by [5], on the other hand, generalises in an obvious way, but it is already 
very complicated to analyse in the graph case, and the extension of the analysis to the 
hypergraph case seems formidable. However, in common with those two approaches, we first 
find what we call the (w, k + l)-core in the hypergraph, which is an analogue of the (k + 1)- 
core in graphs. We determine the size and density of this core when the random hypergraph's 
density is significantly larger than what is required for the core to form. This result may be 
of independent interest, and uses the differential equation method in a setting which contains 
a twist not encountered when it is applied to the graph case: some of the functions involved 
have singularities at the starting point. (See Section [3] for details.) 

Although we gain information on the threshold of appearance of this core, we do not, 
and do not need to, determine it precisely. From here, we use the natural representation 
of the orientation problem in terms of flows. It is quite easy to generalise the network flow 
formulation from the case h — 2, w = 1 to the arbitrary case, giving a problem that can be 
solved in time 0(m 2 ) for m = O(n). Unlike the approaches for the graph case, we do not 
study an algorithm that solves the load balancing problem. Instead, we use the minimum 
cut characterisation of the maximum flow to show that a.a.s. the hypergraph can be (w, k)- 
oriented if and only if the density of its (w, k + l)-core is below a certain threshold. When 
the density of the (w, k + l)-core is above this threshold, it is trivially too dense to be (w, k)- 
oriented. Even the case w — 1 of our result gives a significant generalisation of the known 
results. We prove that the threshold of the orientability coincides with the threshold at which 
certain type of density (in the case w = 1, this refers to the average degree divided by h) 
of the (w, k + l)-core is at most k, and also the threshold at which certain type of induced 
subgraph (in the case w = 1, this refers to the standard induced subgraph) does not appear. 
For the graph case, our method provides a new proof (for sufficiently large k) of the Karp-Saks 
conjecture that we believe is simpler than the proofs of [5] and [14"] . 

We give precise statements of our results, including definition of the (w, k + l)-core, in 
Section [2j In Section [3] we study the properties of the (w, k + l)-core. In Section [U we 
formulate the appropriate network flow problem, determine a canonical minimum cut for 
a network corresponding to a non-(w, /c)-orientable hypergraph, and give conditions under 
which such a minimum cut can exist. Finally, in Section [5j we show that for k is sufficiently 
large, such a cut a.a.s. does not exist when the density of the core is below a certain threshold. 

An extended abstract for this paper, omitting most proofs, will appear in STOC 2010 |16j . 

2 Main results 

Let h > w > and k > 2 be fixed. For any h- hypergraph H, we examine whether a (w, k)- 
orientation exists. We call a vertex light if the degree of the vertex is at most k. For any light 
vertex v , we can give v the positive sign respect to any hyperedge x that is incident to v (we 



4 



call this partially orienting x towards to v), without violating the condition that each vertex 
has indegree at most k. Remove v from H, and for each hyperedge x incident to v , simply 
update x by removing v. Then the size of x decreases by 1, and it has one less vertex that 
needs to be given the positive sign. If the size of a hyperedge falls to h — w, we can simply 
remove that hyperedge from the hypergraph. Repeating this until no light vertex exists, we 
call the remaining hypergraph H the (w, k + l)-core of H. Every vertex in H has degree at 
least k + 1, and every hyperedge in H of size h — j requires a (w — j)-orientation in order to 
obtain a ^-orientation of the original hyperedge in H. 

In order to simplify the notation, we use n, fh and p to denote the numbers of vertices 
and of hyperedges, and the average degree, of H G Gn,m,hi reserving n, rrih-j and // to denote 
the numbers of vertices and of hyperedges of size h — j, and the average degree, of H. 

Instead of considering the probability space Gn,m,h, we may consider Ain,m,h, the proba- 
bility space of random mult ihyper graphs with n vertices and fh hyperedges, such that each 
hyperedge x is of size h, and each vertex in x is chosen independently, uniformly at random 
from [n]. Actually ftAn,m,h m ay be a more accurate model for the off-line load balancing 
problem in some applications, and as we shall see, results for the non-multiple edge case 
can be deduced from it. For a nonnegative integer vector m = (m 2 , . . . , m^), we also de- 
fine the probability space A4 n ,m, being the obvious generalisation of M. n , m ,h to non-uniform 
multihypergraphs in which m 8 is the number of hyperedges of size %. 

All our asymptotic notation refers to n — » oo. For clarity, we consider H G M.n,m,h- We 
use n, mh_j and /i for the number of vertices, the number of hyperedges of size h — j and the 
average degree of H. We parametrise the number fh of edges in the hypergraphs under study 
by letting p = p{n) denote hfh/n, the average degree of H G M.n,m,h ( or of H G Qn,m,h)- 

Our first observation concerns the distribution of H and its vertex degrees. Let Multi(n, m, k+ 
1) denote the multinomial distribution of n integers summing to m, restricted to each of the 
integers being at least k + 1. We call this the truncated multinomial distribution. 

Proposition 2.1 Let h > w > 1 be two fixed integers. Let H G Ain,m,h ^nd let H be its 
(w, k + l)-core. Conditional on its number n of vertices and numbers rrih-j of hyperedges of 
size h — j for j = 0, . . . , w — 1, the random hypergraph H is distributed uniformly at ran- 
dom. Furthermore, the distribution of the degree sequence of H is the truncated multinomial 
distribution Multi(n, m, k + 1) where m = ^2jZ (h — j)mh-j. 

The following theorem shows that the size and the number of hyperedges of H are highly 
concentrated around the solution of a system of differential equations. The theorem covers 
the cases for any arbitrary h > w > 2 and holds for all sufficiently large k. The special case 
w = 1 has been studied by various authors and the concentration results can be found in [5j 
Theorem 3] which hold for all k > 0. Since w and k are fixed, we often omit them from the 
notation. 

Theorem 2.2 Let h > w > 2 be two fixed integers. Assume that for some constant c > 1 we 
have ck < p where p = hfh/n. Let H G Ain,m,h and let H be its (w, k + l)-core. Let n be the 
number of vertices andrrif l _j the number of hyperedges of size h — j of H. Then, provided k is 
sufficiently large, there are constants a > and (3h-j > 0, defined in (TX3]) below, depending 
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only on /z ; k, w and h, for which a.a.s. n ~ an and m h _j ~ fih-j n for {) < i < w — 1. The 
same conclusion (with the same constants) holds for H e Qn,m,h- 

Note. The full definition of a and fth-j m t ne theorem is rather complicated, involving the 
solution of a differential equation system given below in (l3.4H3.14j) . 

Let V be a hypergraph property and let M. n ,m,h £ V denote the event that a random 
hypergraph from M n , m ,h has the property V. Following [TJ Section 10.1, Definition 4], we say 
that V has a s/iarp threshold function f{n) if for any constant e > 0, P(M. n ,m,h & V) 1 
when m < (1 — e)/(n), and P(A / f njmi / l G T 3 ) — >■ when m > (1 + e)f(n). 

Let ft(if) denote X)7=o — j) m h-j/ n , which we call the w-density of if. We similarly 
define the w-density of any hypergraph all of whose hyperedges have sizes between h — w + 1 
and h. It helps to notice, by the definition of w-density, that 

tik(H) = d(H) — (h — w)m, 

where d(H) denotes the degree sum of H and m = YlJ=o m h-j- We say that a hypergraph 
H has property T if k(H) < k, where H is the (w, k + l)-core of H. The following theorem, 
proved using Theorem 12. 2\ immediately gives the corollary that there is a sharp threshold 
function for property T. 

Theorem 2.3 Let H 6 -Mn,m,/i- £e£ p, be the average degree of H and let H be the (w,k + l)- 
core of H . Then for all sufficiently large k, there exists a strictly increasing function c(p) of 
Ji, such that for any fixed c 2 > C\ > 1 and for any C\k < ft < c 2 k, a.a.s. k(H) ~ c(/2). 

Corollary 2.4 There exists a sharp threshold function f(n) for the hypergraph property T in 
•M.n,m,h an d Gn,m,h provided k is sufficiently large. 

The function c(/x) in the theorem, and the threshold function in the corollary, are determined 
by the solution of the differential equation system referred to in Theorem 2.1. 

We have defined a (w, fc)-orientation of a uniform hypergraph in Section [TJ We can sim- 
ilarly define a (w, A;)-orientation of a non-uniform hypergraph G with sizes of hyperedges 
between h — w + 1 and h to be a simultaneous (w — j)-orientation of each hyperedge of size 
h — j such that every vertex has indegree at most k. By counting the positive signs in orien- 
tations, we see that if property T fails, there is no (w, fc)-orientation of H, and hence there is 
no (w, &;) -orient at ion of H. 

For a nonnegative integer vector m = (rrih~ w +i, . . . , rrih), let Ai(n, m, k + 1) denote -M n ,m 
restricted to multihypergraphs with minimum degree at least k + 1. By Proposition I2.1[ 
Ai(n, m, k + 1) has the distribution of the (w, k + l)-core of H e Aifi,m,h conditioned on the 
number of vertices being n and the number of hyperedges of each size being given by m. To 
emphasise the difference, we will use G to denote a not-necessarily-uniform hypergraph in 
cases where we might use H for a uniform hypergraph. 

Given a vertex set S, we say a hyperedge x is partially contained in S if \x H S\ > 2. 

Definition 2.5 Let < 7 < 1. We say that a multihypergraph G has property A{^) if for 
all S C V(G) with \S\ < 7|V(G)| the number of hyperedges partially contained in S is strictly 
less than k\S\/2w. 



6 



In the following theorem, m = m(n) denotes an integer vector for each n. 

Theorem 2.6 Let 7 be any constant between and 1. Then there exists a constant N > 
depending only on 7, such that for all k > N and any e > 0, if m(n) satisfies Y^j=o ( w ~~ 
j)rrih-j{n) < kn — en for all n, then G £ M. (n, m(n), k + 1) a.a.s. either has a (w,k)- 
orientation or does not have property A(^) . 

Let f(n) be the threshold of property T given in Corollary [231 We show in the forthcoming 
Corollary 14.31 that for certain values of 7, a.a.s. H has property ^.(7) if the average degree of 
H is at most hk/w. We will combine this with Corollary 12.41 and Theorem 12.61 and a relation 
we will show between A4n,m,h and Gn,m,h (Lemma 13. ip . to obtain the following. 

Corollary 2.7 Let h > w > be two given integers and k be a sufficiently large constant. 
Let f{n) be the threshold function of property T whose existence is asserted in Corollary \2.4\ 
Then f(n) is a sharp threshold for the (u>, k)-orientability of M.n,m,h and Gn,m,h- 

For any vertex set 5* C V(H), define the subgraph w-induced by 5* to be the subgraph of 
G on vertex set 5* with the set of hyperedges {x' — sfl S : x £ H, s.t. \x'\ > h — w + 1}. Call 
this hypergraph Hs- It helps to notice that H is the largest w-induced subgraph of H with 
minimum degree at least k + 1. From the above results and a relation we will show between 
■Mn,m,h and Qn,fh.hi we will obtain the following. 

Corollary 2.8 The following three graph properties of H £ M.n,m,h (orQ nt m,h) have the same 
sharp threshold. 

(i) H is (w , k)-orientable. 

(ii) H has property T ■ 

(Hi) There exists no w-induced subgraph H' C H with n(H') > k. 

3 Analysing the size and density of the (w, k + l)-core 

A model of generating random graphs via multigraphs, used by Bollobas and Frieze [1] and 
Chvatal [S], is described as follows. Let 7\m be the probability space of functions g : [in] x 
[2] — > [ft] with the uniform distribution. Equivalently, Vn,m can be described as the uniform 
probability space of allocations of 2rh balls into n bins. A probability space of random 
multigraphs can be obtained by taking {g(i, 1), g(i, 2)} as an edge for each i. This model 
can easily be extended to generate non-uniform random multihypergraphs by letting m = 
(m 2 , . . . , m h ) and taking V n ,m = {g '■ uf =2 [mj] x [i] — >• [n]}. Let A4n,m be the probability space 
of random multihypergraphs obtained by taking each {g(j, 1), . . . , g(J, i)} as a hyperedge, 
where j £ [rrii] and 2 < i < h. (Loops and multiple edges are possible.) Note that Ain,m, 
where m = (m 2 ) = (m), is a random multigraph; it was shown in [8] that if this is conditioned 
on being simple (i.e. no loops and no multiple edges), it is equal to ^ s ,m,2, and that the 
probability of a multigraph in M.n,(m) being simple is f2(l) if rh = 0{n). This result is easily 
extended to the following result, using the same method of proof. 
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Lemma 3.1 Assume h > 2 is a fixed integer and m = (7712, . . . , m/J a non-negative integer 
vector. Assume further that ^^=2 m « = 0{n) . Then the probability that a hypergraph in Ain,m 
is simple is Q(l). 

Cain and Wormald [6] recently introduced a related model to analyse the fc-core of a 
random (multi)graph or (multi) hypergraph, including its size and degree distribution. This 
model is called the pairing- allocation model. The partition- allocation model, as defined below, 
is a generalisation of the pairing-allocation model, and analyses cores of multihypergraphs with 
given numbers of hyperedges of various sizes. We will use this model to prove Theorem 12.61 
and to analyse a randomized algorithm called the RanCore algorithm, defined later in this 
section, which outputs the (w, k + l)-core of an input /i-hypergraph. 

Given h > 2, n, m = (m 2 , . . . , m h ), L = (/ 2 , ■ ■ ■ ,lh) an d a nonnegative integer k such 
that D — £ > kn, where D = X^=2* m « an d £ — X^=2^' ^ ^ be a set of n bins, and M a 
collection of pairwise disjoint sets {Mi, . . . , Mh}, where Mi is a set of irrii balls partitioned 
into parts, each of size i, for all 2 < i < h. Let Q be an additional bin to V. It may assist 
the reader to know that Q 'represents' all the hyperedge incidences at vertices of degree less 
than k, and li is the number of these incidences in edges of size i. The partition- allocation 
model V(V, M, L, k) is the probability space of ways of allocating balls to bins in the following 
way. Let C = {02, . . . , Ch} be a set of colours. Colour balls in Mi with q. (The function of 
the colours is only to denote the size of the part a ball lies in.) Then allocate the D balls 
uniformly at random (u.a.r.) into the bins in V U {Q} , such that the following constraints 
are satisfied: 

(i) Q contains exactly I balls; 

(ii) each bin in V contains at least k balls; 

(iii) for any 2 < i < h, the number of balls with colour that are contained in Q is U. 

We call Q the light bin and all bins in V heavy. To assist with the analysis in some 
situations, we consider the following algorithm which clearly generates a probability space 
equivalent to V(V, M, L, k). We call this alternative the allocation-partition algorithm since 
it allocates before partitioning the balls. First, allocate D balls randomly into bins {Q} U V 
with the restriction that Q contains exactly i balls and each bin in V contains at least k balls. 
Then colour the balls u.a.r. with the following constraints: 

(i) exactly irrii balls are coloured with q; 

(ii) for each i = 2, . . . , h, the number of balls with colour Cj contained in Q is exactly U. 

Finally, take u.a.r. a partition of the balls such that for each i — 2, . . . , h, all balls with colour 
Cj are partitioned into parts of size i. 

To prove Theorem 12.21 we will convert the problem to a question about V(V, M, L, k + 1), 
in particular the (w,k + l)-core of the hypergraph induced in the obvious way by the bins 
containing at least k + 1 balls. 
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A deletion algorithm producing the fc-core of a random multigraph was analysed in [6]. 
The differential equation method [27] was used to analyse the size and the number of hyper- 
edges of the final fc-core. The degree distribution of the fc-core was shown to be a truncated 
multinomial. We now extend this deletion algorithm to find the (w, fc + l)-core of H in Ain,m,h 
and Gn,m,h- We describe the algorithm in the setting of representing multihypergraphs using 
bins for vertices, where each hyperedge x is a set h(x) of \x\ balls. Initially let LV be the set 
of all light vertices/bins, and let LV = V(H) \ LV be the set of heavy vertices. A light ball 
is any ball contained in LV. 

RanCore Algorithm to obtain the (w,k+ l)-core 
Input: an /i- hypergraph H. Set t := 0. 
While neither LV nor LV is empty, 
t :=t + 1; 

Remove all empty bins; 

U.a.r. choose a light ball u. Let x be the hyperedge that contains u and let v be 

the vertex that contains u; 

If \x\ > h — w + 2, update x with x \ {u}, 

otherwise, remove this hyperedge x from the current hypergraph. If any vertex 
v' G LV becomes light, move v' to LV together with all balls in it; 
If LV is empty, ouput the remaining hypergraph, otherwise, output the empty graph. 

We will prove Proposition 12.11 and Theorem 12.21 by analysing the RanCore algorithm 
using the partition-allocation model and the allocation-partition algorithm which generates 
P(y,M,0,k + l). Define 

/^) = £^.£ = 1-|>-^, (3.1) 

i>k 8=0 

for any integer k > 0. By convention, define = 1 for any k < 0. Let Z(>]A be a truncated 

Poisson random variable with parameter A defined as follows. 

P(Z {>k) = j) = — — • — , for any j > k. (3.2) 

Note that it follows that P(Z(>^) = j) = whenever j < k. The following proposition will be 
used in the proof of Theorem 12.21 and in Section [5j 

Proposition 3.2 For \i > k + 2, there exists a unique real A satisfying A/^(A) = fifk+i(\). 
Moreover, \i > A, and if \i > ck for a fixed c > 1, then \i — A — > and /^(A) — > 1 as k — >■ oo. 

Proof. Since y fk(y) / fk+i(y) is monotonic in the domain y > 0, as shown in [231 Lemma 1], 
there exist a unique A > that satisfies A/&(A) = /x/fe+i as long as /i > infj,>o{y/fe(y)//jfe+i(y)}. 
Clearly / fc (l)// fc+1 (l) </c + 2, so;U>A;-|-2 suffices. Since /&(A) > //t+i(A) for all k > — 1 
by the definition of the function /fc(x) in (13.1 1) , it follows directly that \i > X. For k — > oo, 
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we use well known simple bounds on tails of the Poisson distribution. Set c = 1 + 3a, where 
a > 0. If A < k + ak, then 



Lfc+2afeJ . +1 [k+2ak] j+l 

Xf k (X) ~ Yl e-"^—<(l + o(l))(k + 2ak) £ e^-^— - 
j=k J ' j=k V " l ~ '• 

< ckf k+1 (X) < fif k+1 (X), 

a contradiction. So A > + a/c, whence f k {X) and fk+i(X) are both 1 + o(l/A) and so 
A-// = o(l). I 

The following is essentially [281 Lemma 4.2]. 

Lemma 3.3 Let c > 0, 5 be constants. Let (Y t ) t >i be independent random variables such that 
\Y t \ < c always and EY t < S for all t > 1. Let X = and X t = Yli<t ^ f or a ^ t — 1- Then 
for any e > 0, a.a.s. X n < 5n + e|<5|rz. More precisely, P(X n > 5n + e|5|n) < exp(— Q(e 2 n)). 

Proof of Proposition 12.11 Consider an element H e Ai n m arising from P e Vn, m , where 
m = (0, . . . , 0, rh) with fh corresponding to the value of the coordinate m^. If we merge all 
the bins of P containing k or fewer balls into one bin Q, we obtain in an obvious way an 
element P' G V(V, M, L, k + 1) for an appropriate sequence L = (0,0,..., L ) where L is 
the total degree of light vertices (vertices with degree at most k) in H. Given the parameters 
(V, M, L, k + 1), the number of P that will produce P' is independent of P'. It follows that, 
conditional on the total degree of the light vertices in H, this generates V(V, M, L, k + 1) with 
the correct distribution. Moreover, the hypergraph induced by the vertices of H of degree at 
least k + 1 is also induced in the obvious way by the heavy bins of P' . Hence, it suffices to 
study the (w, k + l)-core of this hypergraph, conditional upon any feasible L. Because of the 
correspondence between V(V, M, L, k + 1) and the random mult ihyper graphs, we sometimes 
call bins in V vertices and the degree sequence of V denotes the sequence of numbers of balls 
in bins in V. 

To this end, we adapt the RanCore algorithm in the obvious way to be run on V(V, M, L, k+ 
1), as follows. In each step t, the algorithm removes a ball, denoted by u, u.a.r. chosen from 
all balls in Q. If the colour of u is Ch-j for % < w — 1, the algorithm recolours the balls in 
the same part as u with the new colour Ch-j-i- If the colour of u is Ch- w +i, the algorithm 
removes all balls contained in the same part as u, and if any heavy bin becomes light (i.e. the 
number of balls contained in it becomes at most k) because of the removal of balls, the bin 
is removed and the balls remaining in it are put into Q. This clearly treats the heavy bins of 
P 1 in a corresponding way to RanCore treating the heavy vertices of H. Thus, the modified 
RanCore stops with a final partition-allocation that corresponds to the (w, k + l)-core of H, 
and this is what we will analyse. 

For easier reference, let g t denote the random partition-allocation derived after t steps 
of this process. Let V t denote its set of heavy bins and let M t denote the class of sets 
{Mt t h-w+i, ■ ■ ■ ,M ty h} such that M tt h-j denotes the set of partitioned balls with colour Ch-j 
in g t for < j < w - 1. Let m t ^j = \M tth -j\/(h - j) and m = (m tth _ w+1 , . . .,m tjh ). Let 
L^h-j denote the number of balls with colour Ch-j in Q and let L = (L^_„, +1 , . . . , L tjh ). Let 
L t = 12j=o L t,h-j- Initially, £f = P', V = V etc. 
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There is a straightforward way to see, by induction on t, that the partition-allocation 
g t , conditional on V t , M t and L t , is distributed as V(y t , M t , h t , k + 1). We have already 
noted that this is true for t = 0. For the inductive step, it suffices to note that that for any 
t > 0, conditional on the values of Vt, Mt, ~L t for gt, the probability that gt+i is any particular 
member g' of V t+ i, Mf+i, L t+ i does not depend on g' . This is because of three facts. Firstly, 
g t is uniform conditional on the parameters at step t. Secondly, the change in the parameters 
determines which type of step the algorithm is taking (e.g. if a heavy bin becomes light). 
Thirdly, each possibility for g t+ i is reachable from the same number of g t and, given the type 
of step occurring, each such transition has the same probability of occurring as step t + 1. 

Furthermore, it is easy to see that the degree distribution of the {w, fc + l)-core, conditional 
on the number of hyperedges of each size, is truncated multinomial. This is because, for any 
V and M, the allocation-partition algorithm which generates V(V, M, 0, k + 1) produces a 
truncated multinomial distribution for the degrees of vertices in V. I 

The proof of Theorem 12.21 uses the differential equation method (d.e. method). In partic- 
ular, we use the following special case of [281 Theorem 6.1]. For each n > let a sequence of 
random vectors , . . . , Y t )o<t<m be defined on a probability space Q n . (We suppress the 
notation n.) Let Ut denote the history of the process up to step t. 

Theorem 3.4 Suppose that there exists C > such that for each i, < Cn always. Let 
D C M! +1 and let the stopping time T be the minimum t such that (t/n, Y^ /n, . . . , Y^ 1 ') £ D . 
Assume further that the following three hypotheses are satisfied. 

(a) (Boundedness hypothesis.) There exists a constant C > such that for all < t < 
min{m,T} ; \Y t+ i — Y t \ < C always; 

(b) (Trend hypothesis.) There exists functions fi for all 1 < i < I such that for all < t < 
min{m, T} and all 1 < i <l, 

E(y$ - F« | U t ) = m/n, Y t W /n, Y t {l) /n) + (1); 

(c) (Lipschitz hypothesis.) For every 1 < i < I, the functions fi are Lipschitz continuous 
in all their variables on a bounded connected open set V where V contains the intersec- 
tion of (t, z^\ . . . , : t > 0) with some neighbourhood of (0, z^\ . . . , : P(Y^ = 
z^'n, 1 < i < I) 7^ for some n). 

Then the following conclusions hold, 
(a) For any (0, z^\ . . . , z®) £ V, the differential equation system 

^T 1 = fi{s,z x ,...,zi), i = l,...,l 
a s 

has a unique solution in V for zi : M —> M with the initial conditions 

z i (0)=z^, i = l,..., I, 
where the solution is extended arbitrarily close to the boundary ofT>. 
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(b) A.a.s. 

Y t (l) = nZi(t/n) + o(n) 

uniformly for all < t < min{cm, T}, where a is the supremum of all x such that the 
solution {z^{x))i<i<i to the differential equation system lies inside the domain T>. 

Our usage of a.a.s. in conjunction with other asymptotic notation such as oQ conforms to 
the conventions in [29]. For more details of the method and proofs, readers can refer to [27J 
Theorem 1], [28, Theorem 5.1] and [28, Theorem 6.1]. In our case, Q n is the probability space 
of sequences of random partition-allocations generated by running the RanCore algorithm on 
graphs with n vertices, where t refers to the t-th step of the algorithm and the Y t are variables 
defined during the algorithm.) 

The idea of the proof of Theorem 12.21 is, roughly speaking, as follows. We use the d.e. 
method to analyse the asymptotic values of random variables defined on the random process 
generated by the RanCore algorithm. The difficulty arises from the fact that the natural 
functions fi for our application are not Lipschitz continuous at x = 0. To avoid this, we 
artificially modify the fi in a neighbourhood of the problem point, and show that the solution 
to the new differential equation system coincides with the original inside a domain Dq which 
contains all points relevant to the random process. Theorem 13.41 then applies to show that 
the asymptotic values of the random variables inside D Q are approximated by the solution of 
the system, we analyse the random variables when they leave D . We show that provided k 
is sufficiently large, the algorithm then terminates quickly, which allows us to estimate the 
size and density of H. 

Proof of Theorem 12.21 It was shown in the proof of Proposition 12.11 that for every 
t, conditional on the values of V t , M t and L t , the partition-allocation g t is distributed as 
V(V t , M t , h t , k + 1). After step t of the RanCore algorithm, define (or recall) the following 
random variables: 

B t total number of balls remaining 

B t h _j number of balls coloured Ch-j 

A tji number of bins containing exactly i balls 

At A^k+i (number of bins containing exactly k + 1 balls) 

L t number of light balls 

Lt,h-j number of light balls that are coloured Ch-j 

H t} h-j number of balls contained in heavy bins that are coloured Ch-j 

HV t | V t | , number of heavy bins 

and note that H t ^-j = B t ^-j — ^t,h-j- 

Recall that n and rrih-j denote the number of vertices and the number of hyperedges of 
size h — j in H, the (w, k + l)-core of H, and ft denotes its average degree. We show that 
if p, > ck for some c > 1 and k is sufficiently large, then a.a.s. n ~ an and m^-j ~ Ph-jn 
for some constants a > 0, f3h-j > which are determined by the solution of the differential 
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equation system given below, on a domain D Q denned below f 1 3 . 1 4 j) . In particular, we will 
show that 

a = z HV (x*), [3 h _j = z HA _j(x*)/ (h - j), (3.3) 

where x* is the smallest positive root of zl(x) = 0. 

The d.e. method relies on a relation between solutions of a differential equation system 
and the random variables of the process under consideration. We will use subscripts of the 
real valued functions to indicate their corresponding random variables. For instance, the real 
function zi,h-j{x) is associated with the random variable L t) h-j- The differential equation 
system is as follows. 

J f„\ - ZL > h ~j ( 1 ( h ~j- 1 ) Z L,h-j 

z L,h-j\ x ) — — l _i : 

ZL \ z B,h-j 

ZL,h-w+l f (h — w)zH,h-w+l (k + 1)za ^ z H,h-j 



z H,h-j( x ) 



ZL \ ZB,h-w+l Z B — Z L %B — %L, 

+ z L ,H- j+ i MKjH ± l ; j = !,...,«,- 1, (3.4) 

Z L ZB,h-j+l 

z L ,h-j ( (h-j- l)z H ,h-i 



Z L \ ZbK- 



ZL,h-w+l ( {h — w)Z}i,h-w+l (fc + l) 



-J 



■ k 

Z L V Zg^h-w+l z b — Zl Z B — Z L 

z L,h~j+l (h - j)zn,h-j+ 



ZL ZB,h-j+l 



-, j = l,...,w -1, (3.5) 



z' L (x) = -1 + ZLA - W+1 {- {H ~ W > L '^ + (ft - w)k ■ ZH < h ~ w+1 ■ {k + 1)ZA \.6) 



zl \ zs,h-w+i ZB,h-w+i zb — zl / 

z' B {x) = -l- (/l ~ w) " L ^ +1 (3.7) 

zl 

, z L ,h-w+i (h - w)z Hth _ w+ i (k + l)z A ,„ o\ 

Z HV\ X ) ~ ' {O.tS) 

zl ZB,h-w+i zb — zl 

w/ x ((z'b ~ z 'l) z hv - {zb - z L )z' HV )f k+ i{\) 

4^(A) + A e ^.^-^f. e -A.f) 

IB— 1 W— 1 

z L ,h(x) = z L (x) - ^2z L ,h-j{x), z H;h (x) = z B (x) - z L (x) -^z^h-jix), (3-10) 

i=l i=l 

z B>h -j{x) = z L , h -j(x) + z H>h -j(x), for every < j < w - 1, (3.11) 

A(a;) fe+1 

za(x) = x(x)f , ——z HV (x), (3.12) 

where /fe(A) was defined in (13. ip . The initial conditions are 

Zfl(0) = fl, ZL,h-j(0) = 0, z H ,h-j{0) = 0, for all 1 < j < w - 1, (3.13) 

k 

z L (0) = Jx{l-f k {jx)), ^y(0)=l-exp(-/i)^/27i!, A(0)=/2. (3.14) 

i=0 
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Let D be the domain which contains all points such that x G R, < zi,h-j < z B,h-j> 
ZL,h-j < z l for all < j < w - 1, 2i > 0, 2 B - z L > 0, z H y > and (z B - z L )/z HV > k + 2. 
We will call the right hand sides of fl3.4H3.9p the derivative functions, and at present we regard 
them to be only defined in Dq. It is straightforward to check that for any point z* G -Do such 
that ZL,h-j = z B,h-j = 0, the functions specified in the right hand sides of f l3.4H3.8p tend to 
when z approaches z* from the interior of D . For example, note that the term 

ZL z B,h-j 

on the right hand side of ( 13. 4 p is bounded above by (h—j — l)z Lh _j/zi since \z Lh ^j/z Bjh _j\ < 1 
when z G D . Hence, it tends to if z — » z*. The same applies to similar terms in fl3.4p - fl3.8p . 
As part of our definition of the differential equation system (I3.4l) - fl3.14p . we now declare the 
values of these terms at such points z* to be 0. 

In applying Theorem 13. 4[ the variable x will be associated with t/n. As mentioned above, 
the variable z^^-jix) is associated with the variable L t ,h-j/n, which we call the scaled version 
of the random variable L t ^-j- We do the same for the other random variables, and call t/n 
the scaled version of t. 

There are two kinds of problems with the Lipschitz property required in Theorem 13.41 (c). 
The first is caused by terms in the equations with denominators zl or z B — z^ appearing in 
the derivative functions, which are potentially 0, causing singularities. These are relatively 
easy to take care of since they do not become small until near the end of the process. For any 
fixed constant e > 0, define -Do(e) to be the connected subset of Dq obtained by restricting 
to z such that zl > e and z B — zl > e. We will basically restrict consideration to points in 
D (e). Let T be the (stopping) time that the vector of scaled random variables leaves D (e). 
Let t AT denote min{t, T}. The conclusion of Theorem 13.41 will give information on the scaled 
random variables up to the step when they reach the boundary of the domain Dq(e). This 
gives us information about (jiat)o<Kt) where g t is the partition-allocation obtained after step 
t. At that point we will need some further observations to show that the process finishes soon 
afterwards. 

The second type of problem comes from denominators containing z B> h_j, which can be 
even right at the start of the process. This poses a difficulty since the theorem requires 
the derivative functions to be Lipschitz in an open domain containing the starting point. To 
deal with this, we will, at an appropriate point below, extend the differential equations into 
a larger connected open domain D D D , and correspondingly extend -D (e) to D(e). We will 
actually apply Theorem 13.41 with V = D(e) and V = D (e). 

We first verify hypotheses (a) and (b) of Theorem 13.41 which are unrelated to the choice of 
D(e). It is easy to see that the change of each random variable in every step of the algorithm 
is bounded. This is because in every step, the number of balls deleted (or recoloured, or 
moved from heavy bins to the light bin Q) is bounded. Thus, Theorem 13.4( a) clearly holds. 

To verify hypothesis (b), we will need to show that the expected one-step change of each 
random variable, such be approximated to within o(l) error by some function 

of the scaled variables. Replacing the scaled variables in these functions by their associated 
real variables will give the derivative functions in ( I3.4p - ()3.8p . 
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Let g t be the partition allocation obtained after step t. At step t + 1, a partition-allocation 
g t+ i is to be obtained by applying the RanCore algorithm to g t . Let v be the ball randomly 
chosen by the algorithm from Q. Let C(v) be the colour of v , so C(v) = h — j for some j. 
If j < w — 1, the algorithm removes another h — j — 1 balls that are uniformly distributed 
among all balls with colour Ch-j since g t G V(V t , M t , h t , k + 1) as proved in Proposition 12. II If 
j = w — 1, then the algorithm removes v together with another h — w balls which are chosen 
u.a.r. from all balls of colour Ch- w +i- If the removal of the h — w balls results in some heavy 
bins turning into light bins, these bins are removed and the balls remaining in these bins are 
put into Q. 

Now we estimate the expected value of L t +i^-j — L^h-j f° r an Y 1 < J < ^ — 1 and for 
any < t < r conditional on Vt, M t , L t and the event g t G V(V t , M t , L t , fc + 1). Given j, the 
probability that C{v) = c^-j is L t)h _j/L t . If C(v) = Ch-j, one ball of colour Ch-j contained 
in Q is removed, and another h — j — 1 balls of colour Ch-j are recoloured with Ch-j-i (or 
removed if j = w — 1). So the expected number of those balls that are contained in Q is 

(, "V 1)L "- J (i + o(D). 

provided B^-j > logra (say). Hence 

is the negative contribution to E(L t+ i^_j — Lt^^j | 14, M t , L t , g t E V{V t ,'M t ,L t ,k + l)). Note 
that we reach the same conclusion if B t) h~j < log n because in that case 

L tth -j/L t < B tA _j/L t < log n/en = o(l). 

The positive contribution to E(Lt + i^_j — L tt t-j \ Vt, M t , L t , ^ G "P(Vt, M t , L t , /c + 1)) comes 
from the following two cases. 

Case 1: C(v) = Ch- w +i- Here, the algorithm removes v and another h — w balls of colour 
Ch-w+i- P(C(v) = h — w + 1) = L t;h _ w+ i/ L t . We first note that, for any 2 < i < h — w, the 
contribution from the case that i of the h — w removed balls lie in a bin containing at most 
k + % balls is at most ((/c + (B t — L t )Y 1 = o(l), since the definition of V = D (e) ensures 
that the denominator is at least en for t < T. 

It only remains to consider the contribution from the case that a ball in a bin containing 
exactly k + 1 balls is removed. For each ball removed, the probability that it is in a bin 
containing exactly k + 1 balls is 

Ht,h-w+i (k + l)A t)k+1 
B t B t -U 

The removal of such a ball causes the bin to become light. Since g t G V(V t , M t , h t , k + 1), the 
balls of each colour are uniformly distributed among all balls in the heavy bins, and thus the 
expected number of balls of colour Ch-j, for < j < w — 1, among the remaining k balls in 
the bin is 

k-^j- + o(l). 
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In total, h — w balls of colour Ch- w +i are removed, other than v. Hence the expected contri- 
bution to E{L t+ljh ^ - L t>h _j | V t , M t , L t , g t G V{V t , M t , L t , k + 1)) is 

(h-w) k-- j- + o{\). 

L>t -£>t,h-w+l E>% — J-t D t — Lit 

Case 2: C(v) = Ch-j+i- The algorithm removes v, chooses another h — j balls u.a.r. from 
those of colour Ch-j+i, and recolours them with Ch-j. Since P(C(v) = Ch-j+i) = L^h-j+i/L t , 
conditional on C(v) = Ch-j+i, the expected number of balls of colour Ch-j+i that are in the 
light bins and are recoloured is 

L>t,h-j+l 

provided B t)h _j + i >logn. Hence the positive contribution to E(L t+ i >h _j—L tl h-j | V t ,TsA t , L t , g t G 
V{V t ,M t X t ,k + l))is 

hht^l. {h _j). ^±l + o(l) 

Lit L>t,h-j+l 

in this case. The same conclusion holds when B tt h-j+i < log n for the same reason as discussed 
before. Therefore 

E(L t+lj ^ - L t)h _j | V t , M t , L t , g t G V(V t , M t , L t , k + 1)) 

L t,h-j ( _ x _ (h-j- l)L t , h -j \ + Ltfi-j+i (h - j)Lt, h -j+i 



Lt V Bt,h-j J L t B ti h-j+i 

L/t,h-w+l 

f(h-w)H t 

,h-w+l {k + l)A t ]c+i Ht,h-j 



Lt V Bt,h-v>+i B t — L t B t — L t ) 

for j = 1, . . . ,w — 1. Replacing the random variables in the right hand side of (13. 15)) by 
their associated real variables (noting that the scaling cancels out) gives the right hand side 
of (13.4)) . Using a similar approach to computing the expected changes of H t ^-j, B t , D t , 
HVt, conditional on Vt, M 4 , L 4 and the event gt G V{Vt, M t , L 4 , k + 1), we easily obtain the 
derivative functions in (13.5)) — ( |3~8]) . The equations 

w—1 w—1 

Lt,h = L t — L^h-j, H t) h = B t — L t — H ty h-j, 

1=1 i=l 

B t> h-j = L t ,h-j + H t ,h-j, for every h-w + 1 < j < h 

are obvious and lead to (13. 10)) and (13.11)) . 

Let fi t denote (B t — L t )/HV t , the average degree of heavy vertices after step t. Corre- 
spondingly we define a function fi(x) associated with the random variable fi t to be 

fi(x) = (z B (x) - z L (x))/z HV (x). (3.16) 



Then by Proposition 13.2) we may define \(x) by 

\(x)f k (\(x))=ii{x)f k+1 {\(x)) (3.17) 
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provided that > k + 2, which is guaranteed inside D (e). Let X t = X(t/n), so that X t is 
the unique positive root of 

^tfk(Xt) , v 

7 — nTT~^ = - ^ 3 - 18 ) 

Since gt G V(Vt, M t , L 4 , A; + l) for every t, by considering the allocation-partition algorithm 
that generates V{V t , M t , L t , /c + l), the degree sequence of the heavy vertices has the truncated 
multinomial distribution. Hence, by [61 Lemma 1], 

-At \k+l 

where A 4 satisfies ( 13.1 8p . This gives (13. 14j) . 

Now ( 13. 9p . which gives the derivative of A(x), follows by taking the derivative of both sides 
of fl3H]), 



X\x)f k (X(x)) + X(x^ i,AX) 



X'(x) = /i'(x)f k+1 (X(x)) + n{x 

\=\{x) 



df k +i(X) 



dX 

where, by the definitions of fk(X) in ( 13. ip and fi(x) in ( 13. 16ft 



dA 



A'(x), 

A=A(x) 



^(A) _ A A fc 1 _ (z' B - z' L )z HV - (zb - z L )z' HV 

~ltx~- e Tk^Yy. and Mx) - 1Yv ' 

Now we justify hypothesis (c). We will first extend the derivative functions (which, up 
until this point, we restricted to D ) into a larger domain D, which defines an extended d.e. 
system, and show that these extended functions are continuous and Lipschitz inside an open 
domain D(e) extended from D (e). Later we will show that the solution of the extended d.e. 
system for < x < T/n, with the same initial conditions as the original system, is contained 
inside the domain Do(e) and is thus the solution to the original d.e. system. 

We begin with the domain Do(e), which was defined by restricting the points in Do to 
z L > e and zb~zl > e. Recalling our treatment of the possible singularity ZL^-j = z B,h-j = 
just after (I3.14p . each derivative function is continuous in D (e). The only potential problems 
for the Lipschitz property are the constant multiples of the function 

f( z L,h-j, zl, zb .h-j) = J • J ■ (3.20) 

ZL z B,h-j 

However, recalling that zl > e, < ZL,h-j < z B,h-j and ZL,h-j < z l in -D (e), we have that the 
partial derivatives of f(zL,h-j>ZL,ZB,h-j) with respect to ZL,h-j, Zl and ZB,h-j are all 0(l/e), 
from which it follows that the derivative functions are Lipschitz in D (e). 

Let z denote the initial condition vector given by ( I3.13P and (13. 14ft : x = 0, z L = 
fx(l - fk{p-)), z B = p., z HV = 1 - exp(-/x) E?=o^A ! > z L,h-j = z B ,h-j = for all 1 < 
j < w — 1. Note that zq lies on the boundary of both Dq and Do(e). Define D := 
{(x, z L>h - w+ i, . . . , z Lth - 1} ZB,h-w+i, • • • , z B ,h-h zl, z b , z H v) ■ z L > 0, z B - z L > 0, z B - z L > 
(k + 2)z B v}, and let D(e) be the domain obtained by restricting points in D to those with 
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z L > e and z B — z L > e. Thus D(e) is the corresponding extension of D Q (e). Clearly z is an 
interior point in D and D(e). To extend the derivative functions to D, it is enough to extend 
the function / in (13.201) . Define 

f(z L ,h-j, ZB,h-j, Zl) if < Z L ,h-j < Z B ,h-j, ZB,h-j > 

f . \ ) ^ ^ ^L,h—j = ZB,h—j = 0, /q qi\ 

J [ZL,h-j,ZB,h-j,Z L - \ I -r ^ n (d.21J 

1 Z B)h _jjZ L It Zl^-j > Z B ,h-3 > U 

/(kL,h-j|, ks^-il, «l) otherwise. 

We have already shown that / is Lipschitz continuous on .Do(e), which is the first case of (I3.2ip . 
Since z L > e and z B — Zl > e in -D(e), /* is Lipschitz continuous on D(e). Hence, if we modify 
the differential equation system (I3.4j) - (l3.14j) by replacing each expression equivalent to / by 
/*, we obtain derivative functions that are Lipschitz continuous in the open domain D(e). 
Thus, hypothesis (c) holds for this system, which we call the extended differential equation 
system. 

We may now apply Theorem 13 A\ to deduce that a.a.s. uniformly for every < t < T, L t = 
nzi(t/n) + o(n), and the same applies to all the other random variables under consideration. 
We claim that the stopping time T coincides with the time at which L t /n or (B t — L t )/n 
decreases to e. This follows by the following two observations, whose verifications are only 
sketched here since they require straightforward analysis. (See [151 PP- 86,87] for details.) 

(i) The solution of the extended differential equation system is interior to D (e) for all 
sufficiently small x > 0. For instance, all functions taking the value at x = have posi- 
tive derivatives for sufficiently small x > 0. Thus, these functions become positive for any 
sufficiently small x and thus the solution is inside Z?o(e).) 

(ii) Once the solution is interior to -Do(e), the only boundaries of -Do(e) it can reach are 
zl = e, z B — zl = e, zhv = and (z B — zl)/zhv = k + 2. The other boundaries of this 
domain are z L , h -j = 0, z L , h -j = z B ,h-j (i-e. z Hth _j = 0), and z L ,h-j = z L for any j > 0. For 
example, it cannot reach ZL,h-j — because the only negative contribution to the derivative 
of ZL,h-j is proportional to ZL,h-j itself. In view of this, ZL,h-j < zl for any < j < w — 1 
and the last-listed boundary cannot be reached. 

Let x(e) be the smallest value of x such that Zl(x) = e, z B (x) — Zl(x) = e, zhv( x ) — or 
z B (x) — zl{x) — (k + 2)z HV (x), i.e., /x = k + 2, considering the definition fl3.16p . Then the 
solution to the extended differential equation system for all < x < x(e) is also the solution 
to the original differential equation system. Let x* be the smallest real number such that 
zl(x*) = 0, z B (x*) — zl(x*) = 0, Zhv = or \i = (k + 2). Then the solution of the original 
differential equation system can be extended arbitrarily close to x*. 

By the theorem's hypothesis, p, > ck for some c > 1. We next show that for sufficiently 
large k (depending on the value of c), the function z L (x) reaches before z B (x) — Zl(x) or 
zhv( x ) reach or \x reaches k + 2, and we also provide an upper bound of the value of x*. 
Let z H (x) = z B (x) - z L {x). Clearly z L {x) = J27=o z L,h-j(%) and z H (x) = Y17=o z H ,h-j(x). 
So (13.61) . (13.71) and (13.81) immediately lead to 

z' L (x)<-l + hk{k + 1)ZA , z' H >-h- hk{k + 1)z \ z' HV (x)>- h{k + 1)ZA . (3.22) 

zh z h z h 
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Let 5 = (1 — fk(p,))p>. Then the initial conditions give Zl(0) = 5 and z H (0) — ft — 6. Since 
ft > ck for some c > 1, 5 = exp(—Q c (k)). By Proposition 13.2} we may assume that as 
long as n(x) > c'k for some d > 1 and k sufficiently large, A(x), is well defined by (13.1 7} , 
and \[i{x) — X(x)\ < 1, which implies that za{x)/ zh(x) = exp(—Q c >(k)) by (13.191) . We next 
observe that /i(0) > A(0) = ft by (I3.14p and Proposition 13.21 Let [0, Xq] be an interval such 
that > p — Ah for all < x < Xq. Certainly /i(x) > c'k for some c' > 1 for all 

< x < Xq. We may choose k sufficiently large (depending only on the value of c') that 
5 < 1 and for all < x < x we have \X(x) — fi(x)\ < 1, hk{k + 1)za(x)/z h (x) < 1/2 and 
h{k + 1)za(x)/ zh(x) < 1/8. Then for all < x < xo 

z' L {x) < -1/2, > z' H {x) >-h- 1/2, z' HV {x) > -1/8. (3.23) 

Note that znvify — 1) an< i z 'hv( x ) < from (13. 8p . Thus fi(x) = zh(x)/zhv( x ) > zh{x) 
for any < x < x*. Hence, provided Zh(x) > fi — 4:h, we have /i(x) > p — 4h and so the 
inequalities (13.231) hold. Then zh(x) > -2_ff(0) + x (— A — |) = /I — 5 + a;(— /i — |) > p — Ah 
provided x < 35 say, since 5 is arbitrarily small for large k. It follows that n(x) > zh{x) > 
p — Ah for x < min{a;*, 35}. Thus we may choose xq > min{a;*, 35}, and so f!3.23j) implies, for 
any < x < min{x*, 35}, that 

35 x 
z H v{x) > z HV (0) - — > 0, z L (x) <S--. (3.24) 

So x* < 35 and z^n/^*) > 0, since otherwise 35 < x* and zl(35) < 5 — 35/2 < 0, contradicting 
the definition of x*. Combining this with /i(x) > zh{x) > p — 4/i, which is greater than k + 2 
for sufficiently large we conclude that zl(x) reaches before z H (x) or z HV (x) reaches 
and before fi(x) reaches k + 2 (in fact, before fi(x) reaches p — Ah) and x* < 35. We also have 
that z' L (x) < —1/2 for all x < x*. 

For notational convenience, define the following limits from below (which we know to exist 
from the above bounds on the functions and their derivatives): 

ZH,h-j(x*) '■= lim zn,h-j(x) and zhv(x*) := lim zhv(x). (3.25) 

Note that this definition yields continuous functions zn,h-j(x) and zhv(x) on the closed 
interval [0, x*]. 

Given any sufficiently small e > 0, let x(e) be the root of z L {x) = e and let t(e) = [^(e)^J • 
Let Y t denote any of the random variables H t ^_j or HV t , and y(x) its associated real function. 
We have shown that a.a.s. 

Y\_xn\ = ny{x) + o(n) (3.26) 

for < x < x(e). Also, we have \Y\ xn ^ — Y^)^ | = 0((x — x(e))n) for all x(e) < x < x* since 
the change of each variable in every step is bounded by 0(1). Let <5i(e) denote the number 
of light balls remaining at step t(e). Then 5i(e) = en + o(n). Applying Lemma 13.31 with 
Xq = L t(e ), X n = L t{e)+iSl (e), n = A5i(e), 5 = -1/2 and c = h, we have a.a.s. L t{e)+iSl (e) < 
5i(e) — (A5i(e) /2) /2 = 0. Hence, the time r that the RanCore algorithm terminates a.a.s. 
satisfies r < t(e) + 4<5i(e). If it terminates before fix* , we may artificially let it run to that 
point, with the variables remaining static, thereby defining them on the interval t < [x*n\. 
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Then, letting e — y shows that the conclusion (13. 26j) above applies for < x < x*, noting 
that the function y(x) is continuous on [0,x*] as noted below (I3.25p . We may also conclude, 
since <5i(e) — > as e — >■ 0, that 

r = x*h + o{n) a.a.s. (3.27) 
In particular, we conclude that a.a.s. Hy xn ^ h _j = hz Hh _j(x)+o{n) and HV\xn\ = hzHv(x*) + 
o(n). Since zhv(%*) > as shown above, a.a.s. H has a non-empty (w, k + l)-core H. Recall 
that n and rrih-j denote the number of vertices and hyperedges of size h—j in H. Then a.a.s. 
the number of vertices in H is hZHv{%*) + o(n), and the number of hyperedges of size h — j in 
H is nzH,h-j(x*) I {h — j) + o{n). Since z HV (x*) > and ZH,h-j(x*) > 0, we have a.a.s. n ~ an 
and rrih-j ~ fih-j n i where a = zhv( x *) an d fih-j = ZH,h-j(%*) I (h — j)- 

This proves the assertions about H e M.n,m,h- Lemma [37T1 transfers them to Qn,m,h- I 
There are several useful results that we will now derive recalling various pieces of the proof 
of Theorem 12.21 As noted at the start of that proof, the partition-allocation g T output by the 
RanCore algorithm, if it is nonempty, is distributed as V(V T , M T , 0, k + 1) conditional on V T 
and M r . Let n denote \V T \ and rrih-j denote \M T ^-j\/(h — j) for all < j < w — 1. Without 
loss of generality, by relabeling elements in V T and M T in a canonical way, we can simplify 
the notation V(V T , M T , 0, k + 1) to V([n], M, 0, k + 1), where M = (M h _ w+l} . . . , M h ) and 
Mj = [mi] x [z]. The space M, 0, k + 1) is used in the proof of Theorem I2.6l in Section 

Lemma 3.5 Assume C\k < hm/n < 02k for some constants c%> C\> \. Let H be a random 
multihypergraph in M.n,m,h- Then, provided k is sufficiently large, a.a.s. H has a nonempty 
(w, k + l)-core with average degree O(k). 

Proof. Let /2 = hrh/n. Since ft > c\k for some c\ > 1, the existence of a non-empty (w, k+1)- 
core has been shown in Theorem 13 .41 Let x* be as defined in the statement of Theorem 12 .21 and 
let 5 = L /n. We have shown that 5 = 0(e~ n ^) below < KT2\\ and x* < 35 below flHZSD. Let 
zb(x) and zhv ( x ) De defined the same as those functions in (I3.4p - fl3.14p for < x < x*. Then 
clearly zb(x*) < zb(0) since z' B [x) < —1 for all < x < x*. We also have z' HV (x) > —1/8 
for all < x < x* when k is large enough, as shown in the argument below fl3.22p . So 
z H v(x*) > z HV (0)-x*/8 for sufficiently large k. Since z HV (0) = fk+i(ft) = 1 -0(e^ (fc) ) and 
x* = 0(e^ (fc) ), wehave z HV (x*) = l-0(e- n ^). Recall that n (x) = (z B (x)-z L {x))/z H v(x). 
Recall also that z' B (x) — z' L {x) < by the argument below (13. 22 p . Thus, we have /i(0) = 0{k) 
since hrh/n < C2& and 

(x , .jM - >M = °><f» - (1 + (e-^)) = 0(k). 

zhv{x) zhv{0) 

By Theorem 12.21 the average degree of the (w,k + l)-core of H is asymptotically n(x*), which 
is bounded by 0{k). I 

The following lemma gives a lower bound on the size of the (w, k + l)-core of a random 
h- mult ihyp er graph . 

Lemma 3.6 Assume c\k < hfh/h < c^k for some constants C2 > c\ > 1. Let H be a random 
multihypergraph in Ain,m,h- Then a.a.s. the number of vertices in the (w, k + l)-core of H is 
(1 - 0(e- n ^))n. 
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Proof. Let n denote the number of vertices in the (w, k + l)-core of H. We showed just 
after (13.241) that x* < 36 where we had 6 = exp(— Q c (k)). Since in each step at most h heavy 
bins can disappear, the result follows from (13.271) . I 

We need the following lemma before proving Theorem 12.31 

Lemma 3.7 Assume C\k < hfh/n < c 2 k for some constants c 2 > C\ > 1. Let e > be 
fixed. Let Hi be a random multihypergraph in M.n,m,h and H 2 G Mn,m+en,h- Let ni and n 2 be 
the number of vertices in the (w,k + l)-core of Hi and H 2 respectively. Then a.a.s. we have 
\n\ — n 2 \ = 0(e~ n ( k hn). 

Proof. Since c\k < hrh/n < c 2 k, by Lemma |3.5[ the (w, k + l)-core H\ of H\ exists and 
the average degree of H\ is O(k). Let H 2 be a random uniform multihypergraph obtained 
from Hi U £, where £ is a set of en hyperedges, each of which is a multiset of h vertices, 
each of which u.a.r. chosen from [n]. Then H 2 G M.n,m+en,h- We say that the hyperedges 
in £ are marked, and the other hyperedges in H 2 are unmarked. Define a random process 
(# t {1) ,# t (2) ) t > as follows. 

(i) The process starts with (Hq 1 ^, H^) = (Hi,H 2 ). 

(ii) The RanCore algorithm is applied to for every t > 0. The process (H^\ H^) t >o 
stops when the RanCore algorithm running on )<>o terminates. 

i 2 \ 

(iii) For every t > 0, if a marked hyperedge x in H^_\ is updated to x', then x remains 
marked in H^ and if t is defined as H^ x \ if a marked hyperedge x is removed, also 

let = 

(iv) For every t > 0, if an unmarked hyperedge x in H^ is updated or removed, do the 
same operation to x in H^} x and define if t to be the resulting hypergraph. 

We call the random process (H^) t > for i = 1,2 generated by (i/f 3 , Hf the Hi- 
process. Note that the ifi-process is not equivalent to running the RanCore algorithm on Hi, 
since the light balls are not chosen u.a.r. in each step. 

Instead of analysing (H^\ H^) t >o directly, we consider (<% , g^)t>o, the corresponding 
process obtained by considering the pairing-allocation model. Recall that Hi can be repre- 
sented as dropping hfh unmarked balls u.a.r. into n bins with balls evenly partitioned into rh 
groups randomly and H 2 can be represented as dropping hen partitioned marked balls into 
H\. The partition- allocation for % — 1, 2 is obtained by putting all balls contained in light 
bins of Hi into one light bin. Define Lf\ HV} 1 \ and L^, etc., for i = 1, 2 and for t > 0, 
the same way as in the proof of Theorem 12. 2 [ for the ifj-process. Conditional on Lq\ V$\ 
M_f and hf, g® is distributed as V(V (i) , M ( j ] , 1$ , k + 1) for i = 1, 2 and all balls in gfi are 
unmarked. 

Let Jx denote the average degree of Hi and let r be the time the i?2-P r ocess terminates. It 
is easy to show that g^ is distributed as V{Vr , M[ , lS , k + 1) conditional on the values 
of V T {1) , M ( T 1] and since whenever a light ball is chosen, even not uniformly at random, 
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it results in recolouring or removal of heavy balls that are uniformly chosen at random. We 
will later let the RanCore algorithm be run on in the following steps and apply the d.e. 
method to analyse the asymptotic behavior of this process. 

First we show that r = 0(e~ k n). The solution of the differential equation system (13. 4p - 
(I3.14p tells the asymptotic value of L\ in every step t. Let x%\ be the smallest root of 

zf\x) = 0. Since z ( p(0) = 0{e- n ^) and by the argument below (I3T22D . zf\x) < -1/2 for 
all < x < x%-< provided k sufficiently large, we have x%\ = 0(e^ n ^) and so r = 0(e~ n ( k ^n). 
Next we show that n 2 — n\ = 0(e~ n ^en), assuming the following three statements. 

(51) The number of balls that are unmarked and light in but not in is bounded by 
0(e- n «en). 

(52) The number of bins that begin heavy in the ifi-process and become light in that process 
but remain heavy in the ^-process up to step r is 0(e~ Q ^en). 

(53) L { T 1] = 0(e- n ^en). 

Run the Rancore algorithm on g^\ The differential equation system (I3.4I) - (I3.14I) tells 
the asymptotic values of the various random variables in for all t > r. Let x*^ be the 
smallest positive root of z^\x) = 0. Since (r/n) = 0(e~ k e) by (S3) and by the argument 
below (13.221) . z'^\x) < —1/2 for all r/n < x < x*Ls provided k sufficiently large, we have 
x^s — r/n = 0(e~ k e). We also have —1/8 < z' HV (x) < for sufficiently large k for all 

r/n < x < xt^ as explained in Lemma [3751 So HV^ — n\ — 0(e~ k en). Since n 2 — HVr (l) 
counts the number of bins that are, or become light in the ifi-process but stay heavy in the 
.f/2-process, it follows from (SI) and (S2) that n2~HVr V) = 0(e~ k en). So \ni— n 2 \ = 0(e~ k en). 

It only remains to prove (S1)-(S3). We first show that (S3) follows directly from (SI) 
and (S2). ZJp counts two types of light balls. The first type comes from balls that are 
unmarked and light in g$ but not in g^K By (SI), the number of these balls is a.a.s. 
0(e~ n ^en). The second type comes from balls that begin heavy and become light in the 
ifi-process but stay heavy in the ^-process. By (S2), the number of these balls is a.a.s. 
k ■ 0(e- n ^en) = 0(e- n Wen). Thereby (S3) follows. 

Next we show (SI). At step 0, clearly the set of unmarked light balls in gjj is a subset of 
those in g^ . The number of light balls in g^ is a.a.s. (1 — fk(jl))jln = 0(e~ n ^n) as shown 
in the proof of Theorem 12.21 and hence the number of light vertices of Hi is a.a.s. 0(e -t2 "n). 
Since each multihyperedges in S is a random multihyperedges, the expected number of those 
which contains a light vertex in Hi is 0(e~ n ^en), hence the number of light vertex in Hi 
that become heavy after the hyperedges in £ being dropped is a.a.s. 0(e~ n ^en) and each of 
these vertex/bin contains at most k unmarked balls. Thus (SI) follows. 

Now we show (S2). Recall that Hi is represented as dropping hfh unmarked balls u.a.r. 
into n bins and H 2 is obtained by dropping hen extra marked balls u.a.r. into the n bins in 
Hi. Recall that Hi denotes the (w, k + l)-core of Hi. The number of bins that begin heavy 
in the i^i-process and become light in that process but remain heavy in the i^-process up 
to step t is at most the number of bins/vertices not in Hi which receive at least one marked 
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balls after dropping hen marked balls u.a.r. into the n bins. By Lemma I3.6[ the number of 
vertices/bins in Hi is a.a.s. (1 — 0(e^ n ^))n. Then for each marked ball, the probability that 
it is dropped into a bin not in Hi is 0(e~ n ^). By Lemma [3.3[ the number of marked balls 
dropped into bins not in Hi is a.a.s. 0(e~ n ^en). Hence the number of bins that are not in 
Hi and receive at least one marked balls is a.a.s. 0(e~ n ^en). I 

Proof of Theorem 12.31 Let Hi be a random uniform mult ihyper graph with average degree 
/2 and let H 2 be a random uniform multihypergraph obtained from Hi U 8, where 8 is a set 
of en hyperedges, each of which is a multiset of h vertices, each of which is uniformly chosen 
from [n]. 

For i = 1,2, let Hi be the (w, k + l)-core of Hi and let ni^_ - be the number of hyperedges 
with size h — j in Hi. We first show that 

ID— 1 W — 1 

J^O - j)m i l ) _ j - ^(w - fim^Lj > wen/2. 

3=0 3=0 

Clearly H\ is a subgraph of H 2 . Let n\ denote the number of vertices in Hi and let [ni\ 
denote the set of vertices in Hi. By Lemma [3.6} a.a.s. n\ — (1 — 0(e~ n( - k ^))n. Then for any 
hyperedge x G 8, the probability that all vertices in x are contained in [ni] is l — 0(e~~ n ^). So 
the expected number of hyperedges in 8 lying completely in [m] is (1 — 0(e~ n ( k ^))en. By the 
Chernoff bound, originally given in [7J Theorem 1], we have a.a.s. the number of hyperedges 
in 8 lying completely in [ni] is at least en/ 2 for sufficiently large k. So it follows immediately 
that a.a.s., 

w — 1 w— 1 

i=o j=o 

For simplicity, let S'(i) denote Yuj=o( w ~ j) m h-j f° r ^ = 1,2. Recall that ft(-Hj) denotes 
S(i)/ni. Then a.a.s., 

^ ^ (g(i) + W 6n/2)-g(i)-n 2 /n 1 

n 2 ni n 2 

By Lemma 13.71 a.a.s. n 2 — ni = 0(e~ n ^)en, i.e. n 2 /ni — 1 < f(k)e for some function 
f (k) = 0(e" n W). Then a.a.s., 

wen/2 - 0(f(k)eS(l)) 
k{H 2 ) - > - — — > we/A > 0, (3.28) 

for sufficiently large k and for every e > 0, since 5(1) = 0{k)n and n 2 — (1 — 0(e~ n ( fc )))n. 

By Theorem I2.2[ for given /i > u> > and sufficiently large k, a.a.s. k{H) = c(p) + o(l), 
where c(/2) is a constant depending only on ft. The inequality ( I3.28|) implies that c(fl) is an 
increasing function of Jx. I 

Proof of Corollary 12.41 By Theorem 12.31 there exists a unique critical value of /2 such 
that a.a.s. k(H) = k + o(l) and so there exists a threshold function rh = f(n) of A4n,m,h for 
the graph property T. Then this holds as well in Qn,m,h by Lemma I3TT1 I 



23 



The differential equations in Theorem 12.21 are only used in a theoretical way to show 
properties of the (w, k + l)-core, and we do not have an analytic solution. However, they 
can be numerically solved when the values of h, w, k and fi are given. Table [3] gives the 
results of some computations, where h, w and k are given, jl denotes the expected average 
degree of the hypergraph H at the threshold for T given in Corollary I2.4[ and jl denotes the 
corresponding average degree of its core H. Even though our results on the concentration 
of the size and density of the (w, k + l)-core and the threshold of property T only cover for 
the case of sufficiently large k, our numerical computation results as shown in the table do 
coincide with our simulation results. Hence we believe that Theorem 12.2(12.31 and Corollary 12. 41 
actually hold for all k > 1. By Corollary 12 .7\ discussed in the next section, jl is also our main 
target, the threshold for orientability. Note that jl must be at least hk/w by the definition 
of property T, and that it follows from the trivial upper bound of the orientability threshold 
given in the introduction part that jl is at most hk/w. 



h 


w 


k 


jl 




3 


2 


4 


5.485 


6.65086 


3 


2 


10 


14.766 


15.5872 


3 


2 


40 


59.991 


60.0773 


10 


2 


4 


19.99999 


20.0003 



Table 1: Some numerical computation results 



4 The (w, /c)-orientability of the (w, k + l)-core 

In this section we prove Corollary 12.71 assuming Theorem 12.61 and study the basic network 
flow formulation of the problem that is used in the next section to prove Theorem 12.61 

The following lemma is in preparation for proving that H, the (w, k + l)-core of H G 
Mn,m,h, if not empty, a.a.s. has property .4.(7) for some < 7 < 1. 

Lemma 4.1 Let H G M.n,m,h <^nd let H be the (w, k + l)-core of H . Let C\ > 1 be a constant 
that can depend on k. Then there exists a constant < 7 = ip(k,ci) depending only on k 
and c\, such that a.a.s. there exists no S C V(H) with \S\ < ■yn and at least c\\S\ hyperedges 
partially contained in S. More specifically, when c\ > 2 and c x < h 2 e 2 ji, we may choose 
7 = <f(k, c x ) = (ci/ZiV/i) 2 : . 

Proof. Let s be any integer such that < s < n and let r = s/n. Let Y denote the number 
of 5* with l^l = s and at least cis hyperedges partially contained in S. The probability for 
a given hyperedge to be partially contained in S is at most {^(s/n) 2 < h 2 r 2 . Then the 
probability that there are at least C\S such hyperedges is at most 

(: s )"»') 2c ** 
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Since there are ( n ) ways to choose S, 

* E (f)'©"*^ E 

\nn<s<'yn l<s<lnn 



lnn<s<7n N \ / / l<s<lnn 

< ^ (Cr Cl V C1 ) + mn.E__ 

lnn<s<7n 
In n<s<7« 

for some constant < C = C(c\) < (h 2 /ci) Cl e Cl+1 . Choose 



CI -1 



7<1 /^' e C1_1 - 

Then C^ ci ~ l p, ci < 1. So there exist < /3 < 1, such that Cj 01 ' 1 ^ 01 < /?, for all r < 7. When 
ci > 2 and c\jh 2 e 2 Ji < 1, 

\ -^t / \ ci/(ci-l) / \ 2 

e -I" 1 > — E- > 



h 2 e[i J \h 2 e 2 fi J \h 2 e 2 fj, 

Hence we may simply choose 

7 = ( Cl /h 2 e 2 fi) 2 . Then 
{Cr 01 - 1 ^ 1 ) 3 < P* = 0{(3 lan ) = o(l). 

lnn<s<7fi lnn<s<7n 

Hence we have E(Y") = o(l). I 

The following corollary shows that the same property is shared by H. 

Corollary 4.2 Let H G A4n,m,h and let H be the (w, k+l)-core of H. Let c\ be a constant that 
can depend on k, with the constraint that 2 < c\ < h 2 e 2 fl. Let < 7 = (p(k, ci) = (ci/h 2 e 2 jl) . 
Then a.a.s. for all S C V(H) with \S\ < 772, the number of hyperedges partially contained in 
S is less than Ci\S\. 

Proof. Let n be the number of vertices in H and D the sum of degrees of vertices in H. 
For any hyperedge x G H, let x + denote its corresponding hyperedge in H . Obviously n < n. 
Combining with Lemma [4.11 and the fact that for any S C V(H), a hyperedge x is partially 
contained in S only if x + is partially contained S in H, Corollary 14.21 follows. I 

We next show that H, if not empty, a.a.s. has property .4.(7), defined in Definition 12.51 
for some certain value of 7. 
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Corollary 4.3 Assume that H G M.n,m,h with m < hk/w, and that H is the (w, k + l)-core 
of H . Let 7 = e _4 /i~ 6 /4. Then provided k > 4w, a.a.s. either H is empty or H has property 

Art). 

Proof. Apply Lemma [4721 with c\ = k/2w. Clearly c\ < ch 2 e 2 k, and c\ > 2 provided k > 4w. 
Then 7 < <p(k,ci). By Definition I2.5[ H a.a.s. has property -4(7). I 

Proof of Corollary 12.71 Let H be the {k + l)-core of the random multihypergraph H G 
M-n,m,h- Let e > be any constant. By Theorem I2.3[ there exists a constant 5 > 0, such 
that a.a.s. if rh < f{fn) — era, then X^=To ( w ~~ j) m h-j < kn — 5n. By Theorem 12.61 and 
Corollary 14.31 there exists a constant N depending only on h and w such that provided 
k > N, H a.a.s. has a (w, /c)-orientation. On the other hand, if rh > f(rh) + en, then a.a.s. 
Y^j=o( w ~ j) m h-j >kn + 5n, and hence clearly H is not (w, A;)-orientable. Therefore f(h) is 
a sharp threshold function for the (w, fc)-orientation of M-n,m,h- By Lemma [3.11 /(ra) is also 
a sharp threshold function for the (u>, /c)-orientation of Qn,m,h- I 

Let G be a non-uniform multihypergraph with the sizes of hyperedges between h — w + 1 
to h. In the rest of the chapter, we will use the following notations. Let Eh-j := {16 E(G) : 
M = h — j}. For any given S C [n], let rrih~j t i(S) := \{x G -E^-j : \x fl = i}| for any 
< i < h — j . When the context is clear of which set 5* is referred to, we may drop S from 
the notation. Let S denote the set [ra] \ S and let d(S) denote the sum of degrees of vertices 
in S. 

Recall from above the statement of Corollary 12.81 in Section [2] that for any S C V(G), G$ 
denotes the subgraph ^-induced by S. The following Lemma generalises Hakimi's theorem [I~7| 
Theorem 4] for graphs. It is proved using network flow and the max-flow min-cut theorem, 
along the lines of the standard techniques discussed in [9j [26] . This setting was used before 
in connection with the load balancing problem in [251 Section 3.3]. 

Lemma 4.4 A multihypergraph G with sizes of hyperedges between h — w + 1 and h has a 
(w, k) -orientation if and only if k(G$) < k for all S C V(G). 

Proof. Formulate a network flow problem on a network G* as follows. Let L be a set of 
vertices, each of which represents a hyperedge of G, and R be a set of ra, vertices, each of which 
represents a vertex in G. For any u G L, and v G R, uv is an edge in G* if and only if v G u 
in G. Add vertices a and b to G*, such that a is linked to every vertex in L, and b is linked to 
every vertex in R. Let c : E(G*) — > N + be defined as c(au) = w — j for every u G L such that 
the degree of u is h — j, c(vb) = k for every v G R, and c(uv) = 1 for every uv G E{G*). Then 
G has a (w, fc)-orientation if and only if G* has a flow of size Y^j=o ( w ~~ j) m h-j from a to b. 
By the max-flow min-cut Theorem, G* has a flow with all edges incident with a saturated if 
and only if 

w— 1 

c(S(C)) > y^(w - j)m h -j, for all (a,b)-cuts C. (4.1) 

j=0 

As an example in Figure [TJ A C L is a set of hyperedges in G, and S C R is a set of 
vertices in G. Let C = {a}UAuS define a cut of G*. Then the condition in (14. ip is equivalent 
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Figure 1: A cut C in the graph G* 



to 

w—1 I \ w—1 

vc, c(5(c)) = k\s\ + J2[ E E kn^| >E(^-iW-i> ( 4 - 2 ) 

i=o yxeAn£; h _ J les^.^A / i=o 

Let A* := {x G : |rrn *S"| < /i — w}. Clearly A* minimizes c(S(C)) for a given 5. Therefore 
we only need to check (14. 2 j) when A = A*. The condition in (14. 2 p is then equivalent to 

E( E («»-.*)- E ^ n ^i) 

i=o \xe-E fe _j\A xeE h _ s \A J 

For any hypergraph G, let /3(G) denote the number of hyperedges in G. Recall from the 
statement above Theorem 12.31 that \S\k(Gs) = d(G) — (h — w){3(G). Since 

w—1 I \ w—1 

E E ( w ~i)- E kn^|UE E {w-j)-(h-j-\xns\) 

j=0 \xeEk_j\A xeE h _j\A J j=0 x€E h _ 3 \A 

= ]T \x n S\ - E(^ ~w) = d{G s ) -{h- w)/3(G s ) = \S\k{G s ), (4.3) 

xftA x£A 

Lemma 14.41 follows. I 

The next corollary follows immediately. 

Corollary 4.5 A hypergraph H in Qn,m,h has a (w, k)- orientation if and only if for every 
S C V(H), k(H s ) <k. 

Proof of Corollary 12.81 This follows directly from Corollary 12.71 and Corollary 14.51 I 



27 



For any vertex set S, define 

w—1 h—j 

d*(S) = d(S)-J2 E (4-4) 

j=0 i=w—j+l 

which measures a type of expansion in the hypergraph. For each hyperedge x of size h — j 
which intersects S with i vertices, its contribution to d* (S) is w — j > if i > w — j + 1 
and i > otherwise. Therefore d*(S) > for any S. The following lemma characterises the 
existence of the (w, fc)-orientation of G in terms of d*(S). 

Lemma 4.6 Let G be a multihypergraph whose hyperedges all have sizes between h — w + 1 
and h inclusively. Then the following two properties of G are equivalent: 

(i) k(G s ) < k, for all S C V{G); 

(ii) d*(S) > k\S\ + (zrJoV - j)m h - ) - kn, for all S C V(G). 

Proof. Let /3(Gs) denote the number of hyperedges in Gs- We show that for any S C V(G), 

k(G s )\S\ = d(G s )-(h-w)(3(G s ) < k\S\ if and only if d* (S) > k\S\+ (^2J~ \w-j)m h ^ - 

kn. Then Lemma [4.61 follows immediately. Note from the definition of A*, we have for any 
x G E h _j \ A*, \x C\ S\ > h — w + 1 and hence \x fl S\ < (h — j) — (h — w + 1) = w — j — 1. 
By gi, for any S C V{G), 

d{G s ) - (h - w)P{G s ) < k\S\ 

E ( w ~J)- E \% r)S\\ < kn - k\S\ 

j=0 \xeEh_j\A* x£E h _j\A* J 

w — 1 w—j — 1 

^ E E ( w " j ~ *) m h- h i{S) <kn- k\S\ 

j=0 i=0 

w — 1 w—1 / h—j w—j — 1 \ 

E( w ~ j) m h-j ~ E ( E ( w ~ i) m h~jA s ) + E im h-j,i{S) \ <kn-k\S\ 

j=0 j=0 \i=w—j i=0 / 

d*(S) > k\S\ + \ Y,( W ~ j) m h-jj ~ kn. I 

It follows from Lemma 14.41 and Lemma 14.61 that G is (w, /c)-orientable if and only if 
Lemma 14.61 (ii) holds. 

Without loss of generality, we assume YlJ^o ( w ~ 3) m h-j — kn < 0. Otherwise, condi- 
tion (14. II) is violated by taking C = {a} U L U R. The following lemma shows that, instead of 
checking conditions in Lemma [4.61 (ii). we can check that certain other events do not occur. 
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For any S C V(G), let 

h—j w — l h—j — 1 

Ih-jiS) = ^2im h _ jti , T](S) = ^2 m h-i,i- ( 4 - 5 ) 

1=1 j=0 t=l 

In other words, qh-j(S) denotes the contribution to d(S) from hyperedges of size h — j and 
fj{S) denotes the number of hyperedges which intersect both S and S. When the context is 
clear, we may use q^-j and rj instead to simplify the notation. 

Recall that given a vertex set S, a hyperedge x is partially contained in S if \x fl S\ > 2. 
Let p(S) denote the number of hyperedges partially contained in S and let ^(S 1 ) denote the 
number of hyperedges intersecting S. 

Lemma 4.7 Suppose that for some S C V(G), 

d*(S) < k\S\ + \ J2(w - j)m h ^j - kn. (4.6) 

Then all of the following hold: 

(i) p(S) > k\S\/w; 

(ii) v{S) < k\S\; 

(m) (h-w)p(S) > d{S)-k\S\; 

w — l 

(iv) if, in addition, — — -qh-j(S) > (1 — 5)A;|S'| for some 5 > 0, then rj(S) < h 2 5k\S\. 
j=o ~ 3 

Proof. Let s and s denote \S\ and \S\ respectively. If (14. 6B is satisfied, then 

w— 1 h—j / w—l \ w—l 

^2 ( i ~( w ~j)) m h-j,i<ks-lkn-^(w-j)m h - j \ = ^(w - j)m h -j - ks. 

3=0 i=w-j+l \ j=0 J j=0 



Hence 



w—l w—l h—j w—l h—j 

ks < ^2(w - j)m h ^j - im h -j,i + ^2 - ( w - j)) m h-j,i 

j=0 j=0 i=l i=0 i=iu— j+l 

tu— 1 iu— In)- 1— j 

= ^2( W - j) m h-j,0 + Y^ ( W ~ j ~~ l ) m h-j,i 

j=0 j=0 i=l 

w — l / w—l—j 

< w ^ I mfe-^o + Y2 mh ~iy 

j=0 \ i=l 
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Since 

™>h-j,o =\{x e E h _j : \xV\S\ = h- j}\, 

and 

w-l- j 

m h-i,i ^ w \i x e E h-j :2<\xnS\<h-j- 1}\, 

i=i 

(this is because 1 < i < w — 1— j and so h—j — i < h — j — 1 and h—j — i > h — (w — l) > 2) , 
we have 

ks < w\{x E E(G) :\xnS\ > 2}|. 
This proves part (i). Again, if (14. 6ft is satisfied, then 

w—l h—j / w—l 

^ ^ (i-(w - j))m h -.j ti > d(S) -ks+ [kn - ^(w - j)m h . 

j=0 i= w -j+l \ j=0 

Since 

w—l h—j h 

£ (i-(w-j))m h - jti <^(i-l)\{x:\xnS\ = i}\ = d(S)-v(S), 

j=0 i=w—j+l i=2 

we have 

tw-l 
kn — ^^(w — j)rrih- 
j=0 

Since kn — X}j=o ~ j) m h-j > 0, this directly leads to part (ii). Since 

w—l h—j 

(i-(w-j))m h - j , i <{h-w)\{x:\xnS\>2}\, 

j=0 i=w—j+l 

we have 

(w-1 
kn - - j)m h . 

Since kn — Y^=o ( w ~ j) m h-j > 0, this proves part (iii). Now we prove part (iv). Let 

t h -j = 1 - (h- j)m h -j,h-j/qh-j- Note that d(S) = Y^=q <lh-j and q h ^ = J2i=i im h-j,i- For 
each hyperedge x of size h — j which intersects S with % vertices, its contribution to qt-j (and 
thus to d* (S)) is 

• i ■ (w-j)f(h-j), i£i = h-j; 

• i ■ (w — j)/i > i ■ (w — j)/(h — j — 1), if w — j + 1 < i < h — j — 1; 

• i > i ■ (w — j)/(h — j — 1), if 1 < i < w — j; 
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Then 



d*(s) > E (jzj^iO- - fr-j) + h W -j-i qh -i th ->) 



3=0 

w—1 . w—1 

- i^J""-' + h (h-m- 3 -i)'"'-' tb - 1 



3=0 J 3=0 

w—1 . _ w—1 



3=0 J 3=0 



If Y^j=o j^jQh-j > (1 — S)ks for some 5 > 0, then (14. 6 p implies that 

j to — 1 

12 ~~ j)lh-jth-j < 5ks. 

3=0 

Therefore rj(S) < Y^j=o Qh-jth-j < h 2 5ks. This proves part (iv). I 



5 Proof of Theorem 12.6 



Recall from Section [2] that Ai(n,m, k + 1) is Ai njIa , which is a random multihypergraph 
with given edge sizes, restricted to mult ihyper graphs with minimum degree at least k + 1. 
In this section we prove the only remaining theorem, Theorem 12.61 This theorem relates 
the orientability of A4(n, m, k + 1) to its u>-density. Recall that this probability space was 
important because, by Proposition 12.11 it gives the distribution of the (w, k + l)-core H of 
H G M.n,m,h conditioned on the values of n, the number of vertices and m^-j, the number of 
hyperedges of size h — j for each j, in the core. 

It is clear, that given values of n and m, the probability space of random multihypergraphs 
generated by P([n], M, 0, k + 1), with \Mh-j\ — (h — j)rrih-j (h = 0, . . . , w — 1), is equivalent 
to M.(n, m, k + 1). So we may, and do, make use of the partition-allocation model for proving 
results about Ai(n, m, k + 1). 

For the rest of the chapter, let e > and k > 2 be fixed. Without loss of generality, we may 
assume that e < ~ since e may be taken arbitrarily small. By the hypothesis of Theorem 12.61 
we consider only m such that Xof=o C 10 ~~ j) m h-j < kn — en. We may also assume that 
YlJ=o ( w ~ j) m h-j > kn — 2en since otherwise, by Theorem 12.21 we can simply add a set of 
random hyperedges so that the assumption holds. This is valid because (w, /c)-orientability is 
a decreasing property (i.e. it holds in all subgraphs of G whenever G has the property). Let 

W — 1 W — 1 j-y 

D = y^jh - j)m h _j, m = y2 m h-j, l*> — — ■ (5.1) 

3=0 j=0 U 

Since 

^ w—1 w—1 

D ■ - — < y^(w - 3)m h _j < D ■ — , m < VYw - j)m h -j < win, 

3=0 3=0 



31 



and 

w—l 

kn — 2en < ^^(w — j)mh-j < kn — en, (5.2) 

3=0 

we have 

h(k -l)/w < fi = D/n < (h-w + l)k, — - — < m < (k - -)n. (5.3) 

w V 2/ 

In the rest of the paper, whenever we refer to the probability space H(n, m, k + 1) or 
M.(n, m,k + l), we assume m satisfies (15 .2p . 

Since h and w are given, we consider them as absolute constants. Therefore, whenever 
we refer to g = 0(f), it means that there exists a constant C such that g < Cf, where C 
can depend on h and w. We also use notation g = 7 (/), which means that there exists a 
constant C depending on 7 only such that g < Cf. The same convention applies to o(f), 
n(/), 6(/) and o y (f), n y (f) and 9 7 (/). 

By Proposition 12.11 conditioned on the values of n, the number of vertices and rrih-j, the 
number of hyperedges of size h — j, of the (w, k + l)-core H of H e A^n,m,fc, -H" is distributed 
as A^(n, m, k + 1). Recall that e > and k > 2 are fixed. Let m be an integer vector 
with the constraint f)5.2p . Given m, let D, \x be as defined in (15. ip . Let G be a random 
multihypergraph from the probability space M.(n, m, k + 1). 

We next sketch the proof of Theorem 12.61 Let qh-j(S) and ^(S*) be defined as in (14. 5p . The 
partition-allocation model gives a good foundation for proving that a.a.s. certain properties 
hold concerning the distribution of vertex degrees and intersections of hyperedge sets with 
vertex sets. Using this and various other probabilistic tools, we show that 

(a) the probability that G G M.{n, m, k + 1) has property ^.(7) and contains some set S 
with \S\ < jn for which both Lemma l4.7( ii) and (iii) holds is o(l); 

(b) there exists 5 > 0, such that when k is large enough, a.a.s. Y^j=o ^EjQh-j > (1 — ^^"Sl, 

and the probability of G G Ai(n, m, k + 1) containing some set S with / yn < \S\ < 
(1 - i)n and n{S) < h 2 5k\S\ is o(l). 

We also show the deterministic result that 

(c) no multihypergraph G with property ^.(7) contains any sets S with |,S| > (1 — 7)71 for 

which Lemma f4.7( i) holds. 

It follows that the probability that G has property ^.(7) and contains some set S for which 
all parts (i)-(iv) of Lemma [4. 71 hold is o(l). Then by Lemmas 14.61 and 14.71 

P(G G .4.(7) A G is not (w, A;)-orientable) = o(l). 

Finally, Lemma 13.11 shows that the result applies to random (simple) hypergraphs as well. 

We start with a few concentration properties. As discussed in Section [31 the degree se- 
quence of G G M.(n, m, k + 1) obeys the multinomial distribution. The following lemma 
bounds the probability of rare degree (sub) sequences where the degree distribution is inde- 
pendent truncated Poisson. We will use this result to bound the probability of rare degree 
sequences in M.(n, m, k + 1). 
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Lemma 5.1 Let s > w(n) for some w(n) — > oo as n — > oo and let Yi, . . . , Y s be independent 
copies of Z defined in A3.2\) with X satisfying Xfk(X) = fJ,fk+i(X). Let < 5 < 1 be any 
constant. Then there exist N > and < a < 1 both depending only on 5, such that provided 
k > N, 

Proof. Let G(x) be the probability generating function of Yj. Then 
for all x > 0. For any nonnegative integer £, 



,i=0 



Putting x = £/sX gives 

pfe^^h W(AsM - -^j (5 - 4) 



e^" As fesXV f e~ x 



,i=0 



It is easy to check that the right hand side of (15.41) is an increasing function of I when I < Xs 
and decreasing function of / when I > Xs. By Proposition 13. 2[ there exists a constant iVo 
depending only on S such that provided k > N Q , (1 — S)fi < X. Thus, for any £ < (1 — 



•ME* 

and so 



< 



,i=0 



1-5) fis J \fk+i(X) 



x \ (l-<5Ws / -A \ s 

eX \ i e 



~ S )V/ \fk+i(X) 

The expectation of Y\ is \fk(\)/fk+i(X) = \i. By Proposition 13.21 we have /i > A and 
\l — A — » as &; — > oo. Therefore 



Since < 5 < 1, 



exp(/i — A — 
exp(,u — A) / exp(— 5) x ' 
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Since 

exp(/i — A) — > 1, /fc + i(A) — > 1, as k — > oo 



by Proposition 13.21 there exists N% > and < a% < 1, both depending only on 5, such that 
provided k > Ni, 



p 5><(i-*)H 



,i=i 



Now we bound the upper tail of Y^ =1 Vi. Let j = 1,2,.. .. For any £ satisfying (1 + j)fis < 
£ < (2 + j)fJ>s, as with the lower tail bound, 



and so 



, esA y i+ ^>Y e ~ A V _ / exp(^ - A) 



(1+jW \h+iWJ V A+iW V(i+j)( 1+ f 



p ((i + ^ < g r, < (2 + J>s ) < „. (!2g^> • , , . ,5.5, 

Similarly we have 



P i < 1 + < > ^ < | < //.s ( , x , • • ( {1 + 6)il+ s ) ) ) ■ (5-6) 

Since < e s / (1 + S)^ 1+5 ^ < 1 for any S > 0, we may bound the right side of (15. 5p and (15. 6p by 
a>2 S where < «2 < 1 is some constant depending only on 5. Also, since e/(l + j) < 1 for all 
j > 2, the right side of (15. 5p is at most exp(— Q(fi)js) for j > 2 provided k is large enough. 
Hence there exists N 2 > and < a 3 < 1 depending only on 5, such that provided k > iV 2 , 

p^>>(i+*)a«) <«f- 

The lemma follows by choosing a = max{ai, 03} and N = max{A?i, N2}. I 

Lemma 5.2 Let k > — 1 fre an integer. Drop D balls independently at random into n bins. 
Let fi = D/n and let A be defined as Xfk(\) = fifk+i(X). Assume D — {k + l)n — > 00 as 
n — > 00. Then the probability that each bin contains at least k + 1 fraZZs is f2(/fc+i(A) n ). 

Proof. Let d denote (d x , . . . , d n ). Let 9 = {d : d { > k + 1 Vz G [n], £" =1 = Let 
P(-B) denote the probability that each bin contains at least k + 1 balls. Then 



A\, . . . ,d n ) I n D ^— ' J - J - cL! 
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Let Yx,...,Y n be n independent truncated Poisson variables which are copies of Zr>k+i) a s 
defined in (13. 2p with parameter A satisfying Xfk(X) = fJ,fk+i(X). Then 

Since D - (k + l)n -> oo as n ->• oo, P (£" =1 = £>) = f2(Zy~ 1//2 ) (see [22 Theorem 4(a)] 
for a short proof), 

\-n i _ n ^ Aw WA) ? 

de^j=i v 

So, using Stirling's formula, 

p(b) = fi p.^L n LK.^UAr 



(5.7) 

n^e^ 1 / /*i(A) n ). 
Since (///A) • e^- 1 > 1, P(£?) = n(f k+1 (X) n ). I 

Corollary 5.3 Lei k > —1 be an integer. Let @ — {d : dj > k + 1, Vi G M>$^iLi^i = ^} 
and Ze£ A ra 6e any subset of 3) . Let /i = Zy/n. Let P(A0 denote the 'probability that the 
degree sequence d of G G A4(n, m, + 1) is in A n and let PTp{A n ) be the probability that 
(Y"i, . . . , Y n ) G A n where Yi are independent copies of the random variable Zt>k+i) as defined 
in (EJ|) u>rfn i/ie parameter A satisfying Xfk(X) = fifk + i(X). Assume D — (k + l)n oo as 
n — > oo. Then 

P(A n ) = O (y/D) P T p(A n ). 

Proof. Let A n be any subset of 3i and let P{B) denote the probability that each bin contains 
at least k + 1 balls by dropping D balls independently and randomly into n bins. Consider 
the partition-allocation model that generates A4(n, m, k + 1), which allocates the partitioned 
D balls randomly into n bins with the restriction that each bin contains at least k+1 balls. 
Then 

P(A,) = £ PfB) ' Gi. ° , J = S-fej J? 



and 



Therefore 



n e~ A A di e~ An A D " 1 



Jj\ p \nfn 

P{An) = n°P{B)X° VTp{An) = ° V 3 ) Ptp( ^ } ' 
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since P(B) = ((^e^- 1 )^" / fc+1 (A) n ) by Lemma E2 flEZD • I 

A significant difficulty in this work is to ensure that various constants do not depend 
on the choice of e. In particular, we emphasize that the constants such as a and N in the 
following results do not depend on e. 

The next is a corollary of Lemma 15.11 and Corollary 15.31 

Corollary 5.4 Let \i be defined as in 115. Let < 5 < 1 be any constant. Then there exist 
two constants N > and < a < 1, both depending only on 5, such that, provided k > N , 
for any vertex set S C V(G) with \S\ > log 2 n, 

P{\d{S) - fi\S\\ > 6fi\S\) < a^. 

Proof. Let Y\, . . . , Y n be independent copies of the truncated Poisson random variable Z as 
defined in (13. 2p . Let S C V(G) and let s = \S\. Then by Lemma [5.1[ there exist N > and 
< a < 1, both depending only on 6, such that provided k > N, 



ies 



> 8jjis ] < a 



By Corollary 15. 3[ 



P{\d{3)-fis\>6fis) < OiD 1 ^ = ( exp ( ln<S> ^) ^ & X " 



/is 



Since s > log 2 n and so 



lnB(y^) 

> U, as n — >■ oo. 



Let a = 1/2 + d/2. Then < a < a < 1 and a depends only on 5. Then provided k > N, 
P(\d(S) -fis\ > 5ns) < a» s . I 

The following corollary shows that d(S) is very concentrated when S is not too small. 

Corollary 5.5 Let 5 > and < 7 < 1 be arbitrary constants. Then there exists a constant 
N depending only on S and 7, such that provided k > N, 

P(3S C V(G), s > 7 n, \d(S) - y.s\ > 5/is) = o(l). 

Proof. For any S C V(G), let s — \S\. By Corollary there exists N ± > and < a < 1, 
both depending only on 5, such that provided k > Ni, for any S C V(Cr), 



/.ts 



P(\d(S) -ns\ > 5fis) < a 

Let N 2 be the smallest integer such that ea N2 /^ < 1/2. Let iV = max{iVi, A^}. Then N 
depends only on 5 and 7. For all fi > N, 
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P(BS c[n},s>jn,\d(S)-fis\>5fis)< ^ ()° r ^ Yl {~ 

r yn<s<n ^ ' -yn<s<n 

< E (~'oA' = 0(2-t»)=o(1). ■ 

i<a<n ^ ' / 



7n<s<7i 

The following lemma will be used later to prove that a.a.s. YlJ=o ( w ~ j)Qh-j/(h — j) > 
(1 — S)ks provided k is large enough. 

Lemma 5.6 Let C = {cq, . . . , c w -i} be a set of colours. Suppose that D balls are each coloured 
with some colour in C, and let pj denote the proportion of balls that are coloured Cj (0 < j < 
w — 1). Randomly choose a subset of q of the balls. Let qj be the number of balls chosen that 
are coloured with Cj. Then for any < j < w — 1 and < 5 < 1, 

P(|?j -PjQ\ > S Pjl) < exp(-n(5 2 pjq)). 
Proof. For any < j < w — 1 any £ > 0, 



^-fl-W( D .-f)/0- 



Let pi denote P(qj = £)■ Put £ = pjq, i\ — (1 — 5/2)pjq and £2 = (1 — 8)PjQ- Then for any 

£<£%, 



Then 



So 



Pt-i £(D(l- Pj )-q + £) < li(D(l- Pj )- g + l ) _ 1 g 
p h < (1 - S/2)^p tl < exp In (l - < exp(-^g/4). 

Pfe < (1 - = E^ - - ex p(-^(<5 2 m))- 



Similarly we can bound the upper tail and then Lemma 15.61 follows. I 

Lemma 5.7 Let < 5 < 1 and < 7 < 1 be two arbitrary constants. Given S C V(G), let 
qh-j = qh~j(S) be as defined in h4.5\ ). Then there exists N > depending only on 5 and 7 
such that for all k > N, 



w—l 



P ( 3S c V(G), \S\ > 7 n, Y^ qh - ] < (1 " = 

V 3=0 J J 

Proof. For any < j < w — 1, let Pj denoted (/i — j)rrih-j/D. Let J := {j : pj > 5/8w}. 
We first show that given S C V(G) with jS] > 772, if 



w— 1 

^^ 9H <(l-tf)fc|S| ) (5i 
j=o J 
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then there exists j G J such that qt-j{S) < (1 — 5/8)pjd(S). Assume there is no such j by 
contradiction. Then 



MJ — 1 

i=o ' j'gj ' ■ ' ' 

to— 1 

10 -J 




to — 1 . to— 1 

> (1 - 6/8)d(S) TZ^PiO- ~ */8) > (1 - */4)d(S) £ S Pi- (5-9) 
i=o J i=o ^ 



Let s = \S\ and let r — s/n. Then by Corollary 15 .5[ there exists N 2 > depending on 5 and 7 
only, such that a.a.s. d(S) > (1 — 5/A)Dr whenever k > N 2 . Therefore, combining with ( 15. 9p . 
we get s. provided k > max{Ni, N 2 }, 

w—l . til— 1 

V ¥—?-qh-i(S) > (1 - 5/2)Dr V ^^ft > (1 - 5/2)r(A;n - 2en) = (1 - 5/2)(fc - 2e)s. 

At — 7 h — 7 

i=o J i=o J 

For any k>2/5> At/ 5, we have (l-£/2)(Jfe-2e)s > (l-£)Jfe s . Take JV = max{iV 1 , JV 2 , 2/5}. 
Then for any k > N, we have a.a.s. 

to— 1 

i=o rt 3 

which contradicts (15. 8p . It follows that there exists j G J such that qh-j(S) < (1 — 8/8)pjd(S). 

Consider the partition-allocation model that generates 7- > ([n], M, 0, k + 1). Let C = 
{co, . . . ,c w -\} be a set of colours. For balls partitioned into parts that are of size h — j 
for some < j < w — 1, colour them with Cj. Then the iy colours are distributed u.a.r. among 
the D balls. By Lemma [5TB| for any S C V(G), 

P(q h ^(S) < (1 - S/8) Pj d{S)) < exp ( - Q(5 2 Pi d(£)))- 

Then there exists a constant iVi depending only on 5 and 7 such that, 

P(3S,j G J,s > jn,q h ^(S) < (1 - 6/8) P jd(S)) 

< w2 n exp ( - fi(5 3 rf(5))) < iw (2exp(-fi(<5 3 7£;)))™ = o(l). 

Note that the inequality holds because \J\ < w, the number of sets S with |,S| > ■yn is at most 
2 n , 5/8w < pj < 1 for all j G J and d(S') > (fe + l)^! > kjn. It follows that a.a.s. there exists 
no set S with \S\ > for which there exists j G J such that qh-j(S) < (1 — 8/8)pjd(S). 
Lemma 15.71 then follows. I 

Recall that p(S) is the number of hyperedges partially contained in S and v(S) is the 
number of hyperedges intersecting S by the definition above Lemma 14.71 
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Lemma 5.8 Let 5 > be any constant and let fi = fi(G) = D/n as defined in h5.1\) . Then 
there exists a constant N > depending only on 5 such that, provided k > N, a.a.s. there 
exists no S C V(G) for which log 2 77, < IS*! < n, d(S) < (1 — 5)fi\S\, and v(S) < k\S\. 

This lemma will be proved after the proof of Theorem 12.61 

Proof of Theorem 12.61 By Lemma 14.61 and 14.71 it is enough to show that the expected 
number of sets S contained in a hypergraph G G M. (n, m, k + 1) with property -4(7) for which 
all of Lemma [4.71 (i)-(iv) are satisfied is o(l). We call a set S C V(G) is interesting if it lies 
in a hypergraph G with property -4(7). Let X be the number of interesting sets S C V(G) 
such that (14.61) holds. Similarly, let X <a (or X>{, or X^j) for any < a < b < n denote the 
number of interesting S C [n] such that (I4.6[) holds and \S\ < a (or IS*! > b or a < \S\ < b) 
respectively. For any set S under discussion, let s denote \S\ and s denote \S\. 
Case 1: s < en/k. By theorem's hypothesis 

w-l \ 

^2(w - j)m h -j J - kn < -en, 

3=0 J 

any S satisfying (14. 6 p must satisfy 

d*(S) < ks-en. (5.10) 

When s < en/k, ks — en < 0. However d*(S) > as observed below (I4.4p . Hence (15.101) 
cannot hold. Thus X <en / k = 0. 

Case 2: s > (1 — 7)72. part (i) of Lemma [4.71 says that (14. 6 p holds only if the number of 
hyperedges partially contained in S is at least ks/w. But X counts only interesting sets, i.e. 
sets that lie in a hypergraph with property ^.(7). By the definition of property -4(7), there 
are no such interesting sets and so X>(i_ 7 ) n = 0. 

Case 3: en/k < s < 772. Let S± = {h — w)/2h. By Lemma [5. 8[ there exists N\ > such 
that provided k > Ni, the expected number of S with d(S) < (1 — 5i)fxs for which Lemma H~T1 
(ii) is satisfied and en/k < s < n is o(l). We now show that there exists no interesting 
sets S C V{G) with \S\ < 772 for which Lemma H~T1 (iii) holds and d(S) > (1 — 5i)/is. If 
d(S) > (1 - S t )fjLS, d(S) > ^ks provided k>h + w since fM>h(k- l)/w by Q- Then it 
follows that 

d(S) — ks ks 
h — w ~ 2w 

Lemma [4.71 (iii) implies that (14. 6 p holds only if the number of hyperedges partially contained 
in S is at least ks/2w. By the definition of property -4(7), there is no such interesting sets S 
when s < •yn. So provided k > maxjA^x, h + w}, a.a.s. there exists no interesting sets S, with 
s < 772 for which both Lemma [4.71 (ii) and (iii) hold. Then E(X <7 „) = o(l). 

Note that k\S\/2w in the definition of property -4(7) can be modified to be Ck\S\ for any 
positive constant C, and it can be checked straightforwardly that there exists a constant 7 
depending on C only, such that Corollary 14.31 holds. Therefore, any 0<<5i<l — w/h would 
work here by choosing some appropriate C to modify the definition of property -4(7). 
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Case 4-' 1^ < s < (1 — j)n. Let < 5 2 < 1 be chosen later. By Lemma [5.7[ there exists 
N 2 > depending only on S 2 such that provided k > N 2 , a.a.s. 



w— 1 



-(fa-i > (1 — 5 2 )fcs for all S with 772 < \S\ < (1 — 7)71. 

3=0 J 

For any S* C ^(G), let 77 = ^(S") be as defined in (14. 5p . Then by Lemma 14.71 to show 
E(X[ 7 „ 5 (i_ 7 ) n ]) = o(l), it is enough to show that the expected number of sets S with •yn < s < 
(1 —i)n for which r](S) is at most h 2 S 2 ks, is o(l). Consider the probability space Ai(n, m, 0), 
which is generated by placing each hyperedge uniformly and randomly on the n vertices. Let 
B be the event that all bins contain at least k+1 balls. Then Ai(n, m, k+1) equals Ai(n, m, 0) 
conditioned on the event B. By Lemma 15721 P(B) = fi(/ fe+1 (A) n ) where A/^A) = fifk+i(X). 
Given any set S, let r = s/n. For any hyperedge of size h—j, the probability for it to intersect 
both S and S is p jjT = 1 - r h ^ - (1 - r) h ' j . Then p jjT > 1 - 7^' - (1 - 7)^' > 1 - 7 /l "™+ 1 - 
(1 — ^ h ~ w + l for any set S and any < j < to — 1. Recall from ( 15 .ip that m is the total 
number of hyperedges in G. Then Ei](S) = YlJ^o Pj,r m h-j > 777.(1 — ^h-w+i _ ^ _ ^/i-m+i^ 
for any given S. Since m> (k — l)n/w by (15. 3p . 

E77OS) > (l-i h - w+1 -(l-i) h - w+1 ){k-l)n/w = 9 7 (A>, for any 5 with 777 < |S| < (1-7)71. 
Choose 

J _ ryh-' w+l _ M _ ^\h-w+l 

62 = Awh 2 {l -7) ' 

Then 5 2 depends only on 7 and so also depends only on 7. By the Chernoff bound [7], 



P(V(S) < h 2 5 2 ks) < P(t}(S) < h%k(l-j)n) < P ^rj(S) < -Er/(S) j < exp(-Er/(5')/16). 

Note that the second inequality holds because of the choice of 5 2 . So there exists some constant 
C > s.t. 

P{r)(S) < h%ks I B) < Cexp (-B V (S)/16) f k+1 (\y n = C (exp [~^- ~ hi/ fe+1 (A) 

The number of sets 5* with 777 < \S\ < (1 — 7)71 is at most 2™. So the expected number of 
sets 5* with jn < s < (1 — 7)71 and //(S 1 ) < h 2 5 2 ks in .M(?i, m, k + 1) is at most 

c(2exp f-^l-lnf k+1 (X) 

Clearly /fc + i(A) — > 1 as k — >• 00 and Er^S 1 ) = 7 (/c)77 as observed before. Then there exists a 
constant > depending only on 7 such that provided k > N3, 

2exp(-^-lnA +1 (A))<l. 
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Then provided k > max{A^2, N 3 }, 

E(X[ 7nj (x_ 7 ) n ]) = o(l). 

Combining all cases, let N = max{iy 1; N 2 , N 3 , h + w}. Then N depends only on 7. We 
have shown that provided k > N, EX = o(l). Then Theorem 12.61 follows. I 

Proof of Lemma 15.81 The idea of the proof is as follows. When S is big, by Corollary 15.51 
there are no such sets with d(S) < (1 — 5)fi\S\. We will see later that v(S) < k\S\ requires 
a lot of hyperedges partially contained in S, which is unlikely to happen when S is small 
enough. 

Let G G M.{n, m, k + 1). Let D = Y^J=o(h — j)m h _j and fi = D/n as defined in ( 15. ip . For 
any S, let p(S, i) denotes the number of hyperedges with exactly i vertices contained in S. 
Then u(S) < ks if and only if 52 i=2 (« ~ 1) yo(*S f , i) > d(S) — ks. By Corollary 15. 5[ there exists 
Ni > depending only on S such that provided k > N\, a.a.s. there is no S such that s > n/h 
and d(S) < (1 — 5)fis. So we only need to consider sets S with \S\ < n/h. We call a vertex 
set S E G bad if log 2 n < \S\ < n/h, d(S) < (l-5)fi\S\ and ^f =2 (i - 1)^, i) > d(5)-A;|5|. 
Let s denote \S\. 

For any given S, let p(S') denote the probability of S being bad. By Corollary I5.4[ there 
exists N 2 > and < a < 1, both depending only on 5, such that provided k > N 2 , the 
probability that d(S') < (1 — 5)us is at most a^ s . Let p(q,t) be the probability that that 
Sj =2 (* — l)p(»S', i) is at least t conditional on d(S) = q. Then 

P(S)= P (q,q-ks)P(d(S) = q). (5.11) 

(k+l)s<q<(l-S)fj,s 

For the small value of q (or s), we need the following claim, to be proved later. 
Claim 5.9 If q < D/h, then 

hlnt\ ehih - l) 2 q 2 ' ' 
AW 



p(q, t) < [exp 
In particular, ift — > 00 as n — > 00, then 



,'eh 3 q 2 



AtD 

Case 1: s < 2n/eh 3 (k+ 1). Since (k + l)s < q < (1 — 8)/j,s < D/h, we have 
1 / (A;+l)s _, in 9 / (l-5)/xs ^ s _ 



< — r — = k + 1, — < < — , q — ks > s > log n. 



q — ks (k + l)s — ks D fin n 

So q — ks — > 00 as n — > 00. By 05. 11 j) and the particular case of Claim |5T9| we have 

p(s) < E f^)'"^^) 



(fc+l)s<9<(l-(5)/is 



< t ^^ j Pfa + l) g <^)<(l-% g )<( efe(fe 4 + 1)g l a- 
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Note that the second inequality above holds because q — ks > s and < eh 3 {k + l)s/4n < 1 
since s < 2n/eh 3 {k + 1). 

Then the expected number of bad sets S with \S\ = s, for any fixed log 2 n < s < 
2n/eh 3 (k + 1), is at most 



n 



eh 3 (k + l)s 



s J \ An 



cT < • of ■ eh3(k ^ 1)S ) S = (e^(k + l)^/4)« 



Since \i > h(k — l)/w by (I5.3p . this is at most exp(— s) provided k > N3 for some A^3 > 
depending only on a. 

Case 2: s > 2n/eh 3 (k + 1). Take p(q, q — ks) < 1 since p(q, q — ks) is a probability. So 
the expected number of bad sets S with \S\ = s, for any fixed 2n/eh 3 (k + 1) < s < (1 — S)fis, 



is at most 

n 
s 



{^J^y < {e 2 h\k + 1)^/2) s < exp(-s) 



whenever k > N4 for some A^4 > depending only on a. Since a depends only on S, N3 and 
A^ 4 also depend only on 6. Let iV = max{iVi, N 2 , N 3 , N4}. Then iV depends only on 5 and 
provided k > N, the expected number of bad S is at most 

Yl exp(-s) = o(l). 

log 2 n<s<n/h 

Lemma 15.81 follows. I 

It only remains to prove Claim 15.91 

Proof of Claim 15.91 To illustrate the method of computing p(q,t), we show in detail the 
case h = 2 first. Conditional on that d(S) = q, we want to estimate the probability that there 
are at least t edges in S. Consider the alternative algorithm that generates the probability 
space of the partition-allocation model [777.2], 0, k + 1). Fix any allocation which allocates 

q balls into bins representing vertices in S with each bin containing at least k + 1 balls. There 
are at most 

2t) 2H\ 

partial partitions that contain t parts within S. The probability of every such partial partition 
to occur is 

t-i 



D - 1 - 2i 

i=0 



So 



which is at most 



i=0 



[<l] t j-j- q-t-i < (eqy ( q-t ^* < (iq <j " 1 



2H\ LL D — 1 — 2i ~ \2tJ \D-1 ~ \2t D 

i=0 
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Note that the second inequality holds since q < D/2 and so q — t < (D — l)/2. 

Now we estimate p(q,t) in the general case h>2. Let M = ([2m 2 ], . . . , [hm^]). Consider 
the alternative algorithm that generates the probability space of the partition-allocation model 
P([n], M, 0, k + 1), defined in Section[3j Fix any allocation that allocates exactly q balls into S 
with each bin containing at least k+1 balls. The algorithm uniformly randomly partitions balls 
into parts such that there are exactly rrih-j parts with size h — jiorj = 0, . . . , w — 1. Let U = 
{(u 2 , ...,u h )e W^" 1 ) : £(L 2 (i - l)ui = t}. Let u = (u 2 , • • • , u h ) be an arbitrary vector from 
U. We over estimate the probability that p(S, i) is at least Ui for all % = 2, . . . , h, conditional 
on d(S) = q. Let p(q, u) denote this probability. Then clearly p(q,t) < J2 ue uP(q,u). The 
number of partial partitions that contain Ui partial parts of size i within S is 

q \ (2u 2 )\ (hu h )\ 

K Ui, 2u 2 , 3u 3 , ...,hu h J 2\ u *u 2 \ h\ u h Uh \ ' 

where u\ = q — Yli=2 ^ u i- For an y suc h partial partition we compute the probability that it 
occurs. The algorithm starts from picking a ball v unpartitioned in S and then it chooses at 
most h — 1 balls that are u.a.r. chosen from all the unpartitioned balls to be partitioned into 
the part containing v. 

The probability of the occurrence of a given u 2 partial parts of size 2 within S is at most 
1 1 1 1 



(5.12) 



D-l-hi y ' D-l D-h-l D - 1 - h(u 2 - 1) 

i=0 ' 

The probability of the occurrence of a given u 3 partial parts of size 3 within S is at most 

TT ( h ~ l ] - - <(h- D 2 " 3 TT - 

J-l I 2 D-hu 2 -hi-l D-hu 2 -hi-2~ y ' 11 (D-hu 2 -hi-l) 2 ' 

Note that the above inequality holds because h^ i=2 Ui < hq/2 < D/2. Keeping the analysis 
in this procedure, we obtain that the probability of a particular partial partition with 
partial parts of size i within S is at most 

U2— 1 -, U3 — 1 . 



(h _ i)u 2 +2u 3 +.-+{h-l)u h x TT 1 TT 1 

v ; 11 D - hi - 1 11 (D - hu 2 - hi 



i=0 i=0 

X • • • X 



i= o (D-hJ2^ Uj -hi-l)^ H (D-hZ-Zluj-hi-l)^ 
The product of this and (15.121) gives an upper bound of p(q, u), which is at most 

[glgk^Z^ n 1 TT - 

u 2 \u 3 \ ■ ■ ■ u h \2\<* ■ ■ ■ h\«h\ f = \ D - hi - 1 11 (D — h Y^Zl uj -hi-l] 
Since 2\ U2 ■ ■ ■ h\ Uh \ > 2\ this is at most 



h-l 



11 n _ : _ i 11 



i=2 "3 



Ui — 2 



n 2 ! M3 ! • • ■ u h W LI D- hi-l f = l Q (D-h ^ ^ - hi - 
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Since q < D/h and so q — t < (D — l)/h, this is at most 

(eq(h-l)/2Y f q-tY 2 f q - t - u 2 \ u * ( X "" 



= (eqjh - l)/2)« 

«2 2 • • • < h (q - 1 - u 2 ) u * ■■■(q-t- Y^Zl uj)^- 2 ^ 

q-t\ ( q-t-va \ ( q-t- }_^ j=2 uj 



x 



< (eg(fr-l)/2) f ( q-t 



u?(u 3 (q - t - U2 ))«3 ■ ■ ■ - t - Yl;Zl u v 

< (eg(/i - l)/2)< ^ g n ' 



„.1«k V D — 1 



Since q > J2i=2 iu i and t = SiU^ ~~ q-t - Y!j=2 u i - £j=i+i M i > M m for a11 
2 < z < /i — 1, and so 

_ t _ U2 ))«a . . . ( u ^ g _ t _ J2 Uj ) ) h > ufuf* ■ ■ ■ u[ h ~ 1)Uh . 

3=2 

We prove the following claim later. 
Claim 5.10 Let t = X^ =2 (j — ^) u j- Then 

u 2 2u 3 {h-l)u h ^ ( 2t \ 

By Claim I5U0| for any /i > 2, 

Since \U\ < t h , we have 

, , h feh(h-W\ t ( fh\nt\ eh(h-l) 2 q 2 V 

In particular, if t — > oo as n — > oo, then h\nt/t — > and so exp(/i In i/i) < — l)) 2 

provided n is large enough. So 
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Proof of Claim 15.101 We solve the following optimization problem 

(Pi) min m™ 2 ml m3 ■■■mf- 1)mh 

s.t. m 2 + 2m 3 + • • • + (h — l)m/j = t 
m 2 ,m 3 , ...,m h >0 

Letting Xi = (i — l)m, for 2 < i < h, and taking the logarithm of the objective function, 
(Pi) is equivalent to the following optimization problem. 

(P 2 ) min x 2 \nx 2 + x 3 \n(x 3 /2) -\ \- x h \n(x h /(h - 1)) 

s.t. x 2 + x 3 H h x h = t 

x 2 ,x 3 , ...,x h >0 

For convention, let x In x = if x = 0. Applying the Lagrange multiplier yields x* = 
(x 2 , £3, . . . , x* h ) with Xi = 2t{i — l)/h{h — 1), which is a feasible solution of (P 2 ). In order to 
show that this is an optimal solution, we need to show that the optimal solution does not 
appear on the boundary. 

Let x be any solution on the boundary of (P 2 ). Then there exists 2 < i < h such that 
Xi = 0. There also exists j with Xj > 0. Consider x' with x\ = (i — l)xj/h, x'j = Xj — (i — l)xj/h 
and for any I ^ Then x' is feasible and it is straightforward to check that 

x[hi(x'J(i — 1)) + x'j\xi(x'j/(j — 1)) < Xi\n(xi/(i — 1)) + Xj\n(xj/(j — 1)). 

Hence x' cannot be an optimal solution. This proves that x* is the minimizer and so the 
optimal value of (Pi) is exp(t ln(2t/(h(h - 1)))) = (2t/(h(h - 1)))*. I 
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